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TELOMERASE REVERSE TRANSCRIPTASE 
TRANSCRIPTIONAL REGULATORY SEQUENCES 

5 PRIORITY CLAIM 

This application claims the priority basis of U.S. Patent Application 09/244,438. For purposes of 
prosecution in the U.S., the priority application is hereby incorporated herein by reference in its entirety. 

10 FIELD OF THE INVENTION 

The invention is related generally to the fields of genetic regulatory elements that control protein 
transcription in eukariotic cells, and recombinant viral constructs useful for the treatment of disease, including 
cancer. More specifically, the invention describes promoters based on regulatory elements for telomerase 
15 reverse transcriptase, transcriptional control sequences, and the use of these features in the design of oncolytic 
viruses. 

BACKGROUND OF THE INVENTION 

It has long been recognized that complete replication of the ends of eukaryotic chromosomes requires 
20 specialized cell components (Watson (1972) Nature New Biol. 239:197; Olovnikov (1973) J. Theor. Biol. 

41:181). Replication of a linear DNA strand by conventional DNA polymerases requires an RNA primer, and 

can proceed only 5' to 3\ When the RNA primer bound at the extreme 5' ends of eukaryotic chromosomal DNA 

strands is removed, a gap is introduced, leading to a progressive shortening of daughter strands with each 

round of replication. This shortening of telomeres, the protein-DNA structures physically located on the ends of 
25 chromosomes, is thought to account for the phenomenon of cellular senescence or aging of normal human 

somatic cells in vitro and in vivo (Goldstein (1990) Science 249:1129; Martin (1979) Lab. Invest. 23:86; 

Goldstein (1969) Proc. Natl. Acad. Sci. USA 64:155; Schneider (1976) Proc. Natl. Acad. Sci. USA, 73:3584; 

Harley (1990) Nature 345:458-460; Hastie (1990) Nature 346:866-868; Counter (1992) EMBO J. 11:1921-1929; 

Bodnar (1998) Science 279:349-52). 
30 The length and integrity of telomeres is thus related to entry of a cell into a senescent stage. 

Moreover, the ability of a cell to maintain (or increase) telomere length may allow a cell to escape senescence. 

The maintenance of telomeres is a function of a specific DNA polymerase known as telomerase 

reverse transcriptase (TERT). Telomerase is a ribonucleoprotein (RNP) that uses a portion of its RNA moiety 

as a template for telomere repeat DNA synthesis (Morin (1997) Eur. J. Cancer 33:750). Consistent with the 
35 relationship of telomeres and TERT to the proliferative capacity of a cell, telomerase activity can be detected in 

highly replicative cell types such as stem cells. It is also active in an extraordinarily diverse set of tumor 

tissues, but is active in normal somatic cell cultures or normal tissues adjacent to a tumor (U.S. Patent Nos. 

5,629,154; 5,489,508; 5,648,215; and 5,639,613; Morin (1989) Cell 59:521; Shay (1997) Eur. J. Cancer 33:787; 

Kim (1994) Science 266:201 1). Moreover, a correlation between the level of telomerase activity in a tumor and 
40 the likely clinical outcome of the patient has been reported (U.S. Patent No. 5,639,613; Langford (1997) Hum. 

Pathol. 28:416). 

Telomerase activity has also been detected in human germ cells, proliferating stem or progenitor cells, 
and activated lymphocytes. In somatic stem or progenitor cells, and in activated lymphocytes, telomerase 
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activity is typically either very low or only transiently expressed (Chiu (1996) Stem Cells 14:239; Bodnar (1996) 
Exp. Cell Res. 228:58; Taylor (1996) J. Invest. Dermatol. 106:759). 

The preceding summary is intended to introduce the field of the present invention to the reader. The 
cited references in this application are not to be construed as admitted prior art. 



SUMMARY OF THE INVENTION 

This disclosure explains that telomerase reverse transcriptase (TERT) is an ideal target for treating 
human diseases relating to cellular proliferation and senescence, such as cancer. The cis-acting transcriptional 
control elements of the this invention enable identification of trans-acting transcription control factors. The 
discovery and characterization of a promoter specific for TERT expressing cells has provided an opportunity to 
develop important new disease therapies. 

An embodiment of the invention is an isolated, synthetic, or recombinant polynucleotide comprising a 
promoter sequence. A desirable feature of the promoter is that it preferentially promotes transcription of the 
genetic element in cells expressing TERT. such as cancer cells and other cells that can undergo extensive 
replication, such as stem cells. In some cases, the promoter sequence comprises about 15, 50, 100, 150, 200 
250, 500, 1000, 2500 or 13,000 bases in SEQ ID NO:1 or SEQ ID NO:2, or a nucleic acid molecule thai 
hybridizes to such a portion of SEQ ID NO:1 or SEQ ID NO:2 under stringent conditions. Prototype promoter 
polynucleotides are human telomerase reverse transcriptase (hTERT) promoter or a mouse telomerase reverse 
transcriptase (mTERT) promoter, and variants thereof with the desired cell specificity, such as may be 
determined according to the reporter assays provided in this invention. In some cases, the promoter is distinct 
from SEQ. ID NO:6 of W098/14593 (hTERT). or SEQ. ID NO:5 of W099/27113 (mTERT), by virtue of 
sequence variation or increased length in the promoter region. Any feature of upstream or intron sequence that 
affects the rate of transcription in a particular cell can affect performance of the promoter. 

A number of exemplary recombinant plasmids are provided that have the characteristic of 
preferentially promoting transcription in cells expressing TERT. One example (pGRN175 or phTERT175) is a 
promoter from position -117 to position -36, numbered from the translation initiation site (base 13545) of SEQ. 
ID NO:1 - i.e.. bases 13428-13509 of SEQ. ID NO:1. Another example (pGRN176 or phTERT176) is a 
promoter from position -239 to position -36. numbered from the translation initiation site (base 13545) of SEQ. 
ID NO:1 - i. e ., bases 13306-13509 of SEQ. ID NO:1. Other examples include pGRN316.a promoter from 
position -239 to +1 (bases 13306-13545 of SEQ. ID NO:1) and pGRN 350. a promoter from position -1 17 to +1 
(bases 13428-13545 of SEQ. ID NO:1). Thus, preferential promotion in cells expressing TERT can be attained 
with a minimal promoter that is no longer than about 82 bases in length. 

Transcriptional regulatory sequences have been discovered within the promoters of this invention, 
which provide methods for regulating transcription. In another embodiment of the invention, transcription of an 
encoding region under control of a promoter is regulated by modulating a transcriptional regulatory element 
within the promoter. The transcriptional regulatory element is modulated by a factor that binds the regulatory 
sequence, exemplified by SP1, SRY, HNF-3R, HNF-5. TFIID-MBP. E2F c-Myb, and particularly c-Myc, which 
(as shown in Example 8) can in some circumstances be modulated using a ligand for the estrogen receptor. 
Since c-Myc binds to a regulatory sequence known as an E box, another embodiment of the invention is a 
method for expressing a polynucleotide in a cell, comprising transducing the cell with a vector in which the 
polynucleotide is operably linked to an hTERT promoter comprising an E box. and then treating the cell to 
increase binding of a transcriptional regulatory factor such as c-Myc to the E box. The invention also provides a 
method for identifying such transcriptional regulatory sequences and trans-acting factors. 
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Another embodiment of this invention is a promoter that preferentially promotes transcription in TERT 
expressing cells, operably linked to an encoding sequence — for example, an encoding region for TERT, or an 
encoding region that is heterologous to the promoter, operably linked by way of genetic recombination. The 
encoded protein can be of any nature. In one example, the encoded protein can be a toxin, or a protein like 
5 Herpes virus thymidine kinase that renders a cell more susceptible to toxic effects of a drug. Other suitable 
toxins are given later in the disclosure, in another example, the encoded protein can be a reporter gene 
detectable by a signal such as fluorescence, phosphorescence, or enzymatic activity. 

An embodiment of this invention of particular interest is an oncolytic virus having a genome in which a 
promoter is operably linked to a genetic element essentia! for replication of the virus. This includes genes 

10 involved in any stage of the replicative cycle, including replication of the genome, assembly of intact viral 
particles, and any other critical step. The promoter preferentially promotes transcription of the genetic element 
in cells expressing TERT, thereby promoting replication of the virus. Replication of the virus in a cancer cell 
leads to lysis of the cancer cell. In general, oncolytic viruses are useful for treatment of any disease associated 
with expression of TERT in cells at the disease site. 

15 Replication-conditional viruses of this invention include but are not limited to adenovirus of any 

subtype, wherein the adenovirus E1a region is placed under control of a promoter of this invention. Since a 
wide variety of cancer cells and some other types of hyperplasias overexpress TERT, oncolytic adenovirus 
replicates in affected cells, leading to their eradication. It is readily appreciated that other aspects of this 
invention can be incorporated into oncolytic viruses — such as an encoding region for a toxin or other protein 

20 that would compromise viability of the cancer cell. The viruses are selected by using candidate oncoviruses to 
infect a cell or a plurality of cells expressing TERT and not expressing TERT, and then choosing candidates on 
the basis of whether they preferentially kill the cells expressing TERT. 

Other embodiments of the invention are polynucleotide sequence fragments obtained upstream from 
the hTERT encoding region, variants, homologs, and hybridizing polynucleotides. These products are of 

25 interest in part for cis-acting regulatory functions of transcription, including not only promotor activity, but also 
repressor activity, the binding of trans-acting regulatory factors, and other functions described in the disclosure. 
Further embodiments of this invention include cells and organisms introduced with the polynucleotides, vectors, 
and viruses of this invention; methods of treating medical conditions associated with elevated TERT 
expression, and pharmaceutical compositions for the treatment of such conditions. 

30 A further understanding of the nature and advantages of the invention will be appreciated from the 

disclosure that follows. 

BRIEF DESCRIPTION OF THE DRAWINGS 

35 Figure 1 is a restriction map of lambda phage clone A,G95, used for obtaining the sequence about 

15 kbases upstream from the translation initiation site. This region includes the hTERT promoter. 

Figure 2 is a map showing features of an hTERT promoter-reporter plasmid, Reporter plasmids have 
been used to demonstrate that the promoter specifically promotes transcription in cells expressing TERT, 
including cancer cells. 

40 Figure 3 is a sequence alignment, comparing regions of the hTERT promoter (SEQ. ID NO:1) with 

that of mTERT (SEQ. ID NO:2). Regions of homology were used to identify regulatory elements. Figure 3(A) 
shows the position of conserved cis-acting transcriptional regulatory motifs, including the E-box (the Myc/Max 
binding site, indicated by shading) and the SP1 sites (underlined). The lower panel illustrates the proximal 
sequences of the 2.5 kb hTERT and E-box reporter constructs, including the region deleted in the E-box 
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reporter construct, as described in Example 8. Figure 3(B) shows the identification of other regulatory 
elements. The numbering shown is calculated from the translation initiation site. 

Figure 4 is a half tone reproduction of cell lines photographed 7 days after infection with oncolytic 
virus. Top row: uninfected cells (negative control). Middle row: cells infected with oncolytic adenovirus, in 
which replication gene E1a is operably linked to the hTERT promoter. Bottom row: cells infected with 
adenovirus in which E1a is operably linked to the CMV promoter (positive control). 

The cells tested were as follows: Figure 4(A): BJ (foreskin fibroblast); IMR-90 (lung fibroblast); WI-38 
(lung fibroblast); ceils of non-malignant origin. Figure 4(B): A549 (lung carcinoma) AsPC-1 and BxPC-3: 
(adenocarcinoma, pancreas). Figure 4(C): DAOY (medulloblastoma); HeLa (cervical carcinoma); HT1080 
(fibrosarcoma). The results show that the hTERT-regulated oncolytic virus specifically lyses cancer cells, in 
preference to cell lines that don't express telomerase reverse transcriptase at a substantial level. This is in 
contrast to oncolytic virus regulated by a constitutive promoter like CMV promoter, which lyses cells non- 
specifically. 

Figure 5 is a series of maps showing construction of oncolytic adenovirus, made conditionally 
replicative by placing the E1a replication under control of an hTERT promoter. The first construct comprises 
the Inverted Terminal Repeat (ITR) from the adenovirus (Ad2); followed by the hTERT medium-length promoter 
(PGRN176) operably linked to the adenovirus E1a region; followed by the rest of the adenovirus deleted for the 
E3 region (AE3). This construct was used in the virus infection experiments shown in Figure 4. The second 
conditionally replicative adenovirus construct shown in the Figure comprises an additional sequence in between 
the hTERT promoter and the E1a region. The HI sequence is an artificial intron engineered from adenovirus 
and immunoglobulin intron splice sequences. The third adenovirus construct is similar, except that the E1a 
region used is longer at the 5' end by 51 nucleotides. 



DETAILED DESCRIPTION 

The invention provides novel isolated polynucleotides comprising cis-acting transcriptional control 
sequences of telomerase reverse transcriptase genes. The polynucleotides of the invention include those 
based on or derived from genomic sequences of untran scribed, transcribed and intron regions of TERT genes, 
including the human and mouse homolog. Cis-acting TERT transcriptional control sequences include those 
that regulate and modulate timing and rates of transcription of the TERT gene. The TERT promoter sequences 
of the invention include cis-acting elements such as promoters, enhancers, repressors, and polynucleotide 
sequences that can bind factors that influence transcription. 

Isolating and charact erizing human TERT promoter sequence 

As described in Example 1, the hTERT promoter (SEQ ID NO:1) was obtained by sequencing an 
insert from a lambda phage isolated from a human genomic library. This lambda clone is designated M3<p5 and 
has been deposited at the ATCC, under Accession No. 98505. Lambda GG5 contains a 15.3 kilobase pair 
(kbp) insert including approximately 13,500 bases upstream from the hTERT coding sequence. These hTERT 
promoter sequences were further subcloned into plasmids. A Not1 fragment (SEQ ID NO:1) from XGcp5 
containing the hTERT promoter sequences was subcloned in opposite orientations into the Not1 site of pUC 
derived plasmids (designated pGRN142 and pGRN143, respectively, and pGRN142 was sequenced. 

In SEQ ID NO:1, the hTERT genomic insert begins at residue 44 and ends at residue 15375. The 
start of the cDNA from which it was derived begins at residue 13490. The hTERT ATG translation initiation 
codon starts at residue 13545. Untranscribed hTERT promoter sequences lie downstream of residue 44 and 
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upstream of the encoding region, and may also reside in the first Intron. In immortal cells, a reporter gene 
driven by a sequence upstream of the TERT coding sequence drove expression as efficiently as the positive 
control (containing an SV40 early promoter and enhancer). Certain TERT promoter sequences of the invention 
. also include intron sequences. 

Identification of cis-actinq transcripti onal regulatory sequences in the human and mouse TERT promoter 

To identify cis-acting transcriptional regulatory sequences in human TERT and mouse TERT 
sequences 5' to their respective TERT coding sequence, the human and mouse promoter sequences were 
analyzed for sequence identity. Alignment of the first 300 bases upstream of the human and mouse coding 
sequences indicated a number of conserved regions, and putative cis-acting transcriptional regulatory 
sequences were identified (Figure 3(A)). 

In particular, located at residues -34 to -29 upstream of the human TERT translation start site (ATG, A 
at 13545 of SEQ ID NO: 1) and at residues -32 to -27 upstream of the mouse TERT translation start site (ATG) 
are highly conserved motifs. They correspond to a cis-acting motif known to interact with c-Myc, the so-called 
"E-box" or "Myc/Max binding site." Specifically, they are highly conserved with respect to the core nucleotides 
that comprise the E-box, nucleotides flanking the E-box and position of the E-box relative to the translation start 
site. A second E-box was identified at residues -242 to -237 upstream of the human TERT translation start site. 
This second E-box was not conserved in the mouse promoter. These observations support the finding that the 
conserved Myc binding site, by interacting with c-Myc as a trans-acting transcriptional regulatory factor, plays a 
major role in TERT promoter regulation and telomerase expression. 

Sequence alignment identified additional conserved cis-acting transcriptional regulatory elements in 
the TERT gene promoter. For example, two SP1 binding sites, located at residue -168 to -159 and residue - 
133 to -121 relative to the TERT translation start site were identified, which are highly conserved between the 
mouse and human TERT promoters. Binding sites (cis-acting sequences) for a number of other transcription 
factors, including the sex determining region Y gene product (SRY), hepatic nuclear factors 3-0 and 5, TFIID- 
MBP, E2F and c-Myb were also found within this region of both the mouse and human promoters. 

Further analysis of the human and mouse TERT promoter sequences indicated other regions of 
sequence conservation. In particular, a region with a high degree of sequence identity between human and 
mouse promoter was found between residue -1106 and residue -1602 upstream of the human TERT 
translation start site and residue -916 and residue -1340 upstream of the mouse TERT translation start site 
(Figure 3(B)). Thus, the invention provides cis-acting sequences specific for the modulation of TERT 
transcription. In a preferred embodiment, the methods of the invention use these human and mouse TERT- 
specific transcriptional regulatory motifs to identify and isolate TERT-specific, and other, trans-acting 
transcriptional regulatory factors. 

The invention also provides the reagents and methods for screening and isolating trans-acting TERT 
transcriptional regulatory factors. Alternative embodiments include novel in vitro and cell-based in vivo assay 
systems to screen for TERT promoter binding agents (trans-acting TERT transcriptional regulatory factors) 
using the nucleic acids of the invention. 

c-Mvc is a potent activator of TERT gene transcription 

Use of recombinant constructs comprising TERT promoter sequences of the invention has, for the first 
time, demonstrated that c-Myc acts as a potent activator of telomerase activity by direct interaction with cis- 
acting regulatory sequences in the TERT promoter. c-Myc acts through the rapid up-regulation of hTERT gene 
expression (Example 8). Significantly, the studies demonstrate that transcriptional activation of the hTERT 
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promoter by c-Myc can be abrogated by deletion or mutation of a single cis-acting regulatory sequence, the 
"Myc/Max binding site," within the hTERT promoter. Furthermore, the ability of an inducible c-Myc to enhance 
expression of hTERT is resistant to inhibition of protein synthesis. 

TERT promoter used to drive heterologous oene seq . lf >nrv»<; 

The invention also provides constructs in which the TERT promoter sequences of the invention are 
operably linked to a heterologous gene (in a preferred embodiment, a structural gene). In this way the 
heterologous gene is transcribed in the same cells at the same time the natural TERT transcript would be 
expressed. Thus, when the construct is expressed in a transformed cell or transgenic (non-human) animal, the 
heterologous gene (and protein, if the gene is a coding sequence) is expressed in the same temporal pattern 
over the same cell range as the wild type, TERT promoter-driven TERT gene. 

These constructs are useful for TERT promoter-based assays, for example, to identify biological 
modulators of TERT and telomerase activity, in alternative embodiments, the heterologous coding sequence 
operably linked to a TERT promoter of the invention is a marker gene (e.g., alkaline phosphatase, SEAP- (l- 
galactosidase), a modified TERT structural gene or a TERT antisense. a therapeutic gene (e.g., a cytotoxic 
gene such as thymidine kinase). 

In a further embodiment, cytopathic viruses are provided, in particular human cytopathic viruses, such 
as modified adenovirus or Herpes virus. Viruses, such as adenovirus or Herpes virus require essential virally 
encoded genes to proliferate and lyse specific cells. If any one of these essential viral genes were modified 
such that expression of the essential element would be driven by the TERT promoter, proliferation of the virus, 
and its cytopathic effects, would be restricted to telomerase-expressing cells, in particular tumor cells. 

Definitions 

The following terms are defined infra to provide additional guidance to one of skill in the practice of the 
invention. 

The term "amplifying" as used herein incorporates its common usage and refers to the use of any 
suitable amplification methodology for generating or detecting recombinant or naturally expressed nucleic acid. 
For example, the invention provides methods and reagents (including specific oligonucleotide PCR primer 
pairs) for amplifying naturally expressed or recombinant nucleic acids of the invention in vivo or in vitro. An 
indication that two polynucleotides are "substantially identical" can be obtained by amplifying one of the 
polynucleotides with a pair of oligonucleotide primers or pool of degenerate primers (e.g., fragments of an 
TERT promoter sequence) and then using the product as a probe under stringent hybridization conditions to 
isolate the second sequence (e.g., the TERT promoter sequence) from a genomic library or to identify the 
second sequence in a Northern or Southern blot. 

As used herein, the term "TERT promoter" includes any TERT genomic sequences capable of driving 
transcription in telomerase activity positive cells. Thus, TERT promoters of the invention include without 
limitation ds-acting transcriptional control elements and regulatory sequences that are involved in regulating or 
modulating the timing and/or rate of transcription of a TERT gene. For example, the TERT promoter of the 
invention comprises cis-acting transcriptional control elements, including enhancers, promoters, transcription 
terminators, origins of replication, chromosomal integration sequences, 5" and 3' untranslated regions, exons 
and introns, which are involved in transcriptional regulation. These cis-acting sequences typically interact with 
proteins or other biomolecules to carry out (turn on/off, regulate, modulate, etc.) transcription. 

One of skill in the art will appreciate that the hTERT and mTERT promoter sequences provided herein 
are exemplary only, and that they may be used as a basis to produce numerous versions of TERT promoters, 
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i.e., promoters that are capable of driving transcription in telomerase activity positive cells. For example, while 
it is shown herein that a sequence comprising 2447 nucleotides of the disclosed hTERT promoter can drive 
expression in this manner (pGRN350), one of skill in the art will appreciate that such activity may be obtained 
using longer or shorter promoter sequences. Furthermore, one of skill in the art will appreciate that promoter 
sequences that vary from those sequences provided herein by, for example, nucleotide additions, deletions or 
substitutions may also be used to obtain expression in telomerase activity positive cells. Such variants will 
share a specified minimum level of structural (sequence) similarity to the disclosed TERT promoter sequences, 
which similarity may be defined in terms of either sequence identity to the disclosed TERT promoter 
sequences, or the ability to hybridize to the disclosed sequences at specified levels of hybridization stringency. 
For example, variant TERT promoters include promoters that hybridize to the TERT promoters disclosed herein 
(at 37 D C in a buffer of 40% formamide, 1 M NaCI, and 1% SDS, followed by a wash in 1X SSC at 45°C), and 
which are capable of driving transcription in telomerase activity positive cells. Other variant TERT promoters 
include promoters that share at leasst about 80%, 90%, 95%, 98% or 100% sequence identity with the 
disclosed TERT promoters. Sequence identity is calculated by first aligning the polynucleotide being examined 
with the reference counterpart, and then counting the number of residues shared between the sequences being 
compared as a percentage of the region under examination. No penalty is imposed for the presence of 
insertions or deletions, but insertions or deletions are permitted only where clearly required to readjust the 
alignment. The percentage is given in terms of residues in the sequence being examined that are identical to 
residues in the comparison or reference sequence. 

The determination that a promoter is capable of driving transcription in telomerase activity positive 
cells can be routinely performed as described in Examples 2 and 5. Briefly, the promoter to be tested is 
operabry linked to a coding region that encodes a detectable protein such as alkaline phosphatase or green 
fluorescent protein. This construct is then introduced into telomerase activity positive (TAP) and telomerase 
activity negative (TAN) cells. Detection of the detectable protein in the TAP cells but not in the TAN cells, or of 
an elevated level of the detectable protein in the TAP compared to the TAN cells (preferably at least a three- 
fold difference) indicates that the promoter is a TERT promoter. 

A promoter is said to "preferentially promote transcription" in a cell having a particular phenotype if the 
level of transcription is at least about 3-fold higher in cells of that phenotype than cells that lack the phenotype. 
Promoters of this invention preferentially promote transcription in cells expressing TERT, including diseased 
cells where the disease is associated with overexpression of TERT, such as cancer. There is preferential 
transcription if the relative increase in cells expressing the stated phenotype is at least about 3-fold, 10-fold, 30- 
fold or 100-fold higher compared with cells that don't have the phenotype, in order of increasing preference. 
Promoters that show lower levels of specificity in an assay where just two types of cells are compared may be 
tested using a larger panel. One skilled in the art will know that TERT positive cells include various types of 
cancer cells, various types of progenitor cells and stem cells, and under certain conditions, B and T 
lymphocytes. Suitable negative controls include primary cultures and established cell lines of mature 
differentiated cells of most tissue types. 

In alternative embodiments, the TERT promoter sequence comprises TERT sequences upstream of 
the translation start site (ATG), for example, in one embodiment, the hTERT promoter comprises residues 44 to 
13545 of SEQ ID NO:1. Other embodiments include sequences starting within about one to 5 nucleotides of a 
translation start codon (for example in SEQ ID NO:1 or SEQ ID NO:2) and ending at about 50, 100, 150, 200, 
250, 500, 1000, 2500 or 13500 nucleotides upstream of the translation start codon. Such embodiments can 
optionally include other regulatory sequences, such as, exon and/or intron sequences. Another embodiment 
includes TERT intron sequences with regulatory activity, as described in Example 2. hTERT promoters of the 
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invention also include sequences substantially identical (as defined herein) to an exemplary hTERT promoter 
sequence of the invention, having the sequence set forth by SEQ ID NO:1. Similarly, mTERT promoters of the 
invention also include sequences substantially identical to an exemplary mTERT promoter sequence of the 
invention, having the sequence set forth by SEQ ID NO:2. 

The term "heterologous" when used with reference to portions of a nucleic acid, indicates that the 
nucleic acid comprises two or more subsequences which are not found in the same relationship to each other in 
nature. For instance, the nucleic acid is typically recombinantly produced, having two or more sequences from 
unrelated genes arranged in a manner not found in nature; such as a promoter sequence of the invention 
operably linked to a polypeptide coding sequence that, when operably linked, does not reform the naturally 
occurring TERT gene. For example, the invention provides recombinant constructs (expression cassettes, 
vectors, viruses, and the like) comprising various combinations of promoters of the invention, or subsequences 
thereof, and heterologous coding sequences. 

As used herein, "isolated," when referring to a molecule or composition, such as an hTERT promoter 
sequence, means that the molecule or composition is separated from at least one other compound, such as a 
protein, DNA, RNA, or other contaminants with which it is associated in vivo or in its naturally occurring state. 
Thus, a nucleic acid sequence is considered isolated when it has been isolated from any other component with 
which it is naturally associated. An isolated composition can, however, also be substantially pure. An isolated 
composition can be in a homogeneous state. It can be in a dry or an aqueous solution. Purity and 
homogeneity can be determined by analytical chemistry techniques such as polyacrylamide gel electrophoresis 
(PAGE), agarose gel electrophoresis or high pressure liquid chromatography (HPLC). 

As used herein, the terms "nucleic acid" and "polynucleotide" are used interchangeably, and include 
oligonucleotides. They also refer to synthetic and/or non-naturally occurring nucleic acids (including nucleic 
acid analogues or modified backbone residues or linkages). The terms also refer to deoxyribonucleotide or 
ribonucleotide oligonucleotides in either single-or double-stranded form. The terms encompass nucleic acids 
containing known analogues of natural nucleotides. The term also encompasses nucleic acid-like structures 
with synthetic backbones. DNA backbone analogues provided by the invention include phosphodiester, 
phosphorothioate, phosphorodithioate, methyl-phosphonate, phosphoramidate, alkyl phosphotriester! 
sulfamate, 3-thioacetal. methylene (methy limine), 3'-N-carbamate, morpholino carbamate, and peptide nucleic 
acids (PNAs); see Oligonucleotides and Analogues, a Practical Approach, edited by F. Eckstein, IRL Press at 
Oxford University Press (1991); Antisense Strategies, Annals of the New York Academy of Sciences, Volume 
600, Eds. Baserga and Denhardt (NYAS 1992); Milligan (1993) J. Med. Chem. 36:1923-1937; Antisense 
Research and Applications (1993, CRC Press). PNAs contain non-ionic backbones, such as N-(2-aminoethyl) 
glycine units. Phosphorothioate linkages are described in WO 97/03211; WO 96/39154; Mata (1997) Toxicol. 
Appl. Pharmacol. 144:189-197. Other synthetic backbones encompassed by the term include methyl- 
phosphonate linkages or alternating methylphosphonate and phosphodiester linkages (Strauss-Soukup (1997) 
Biochemistry 36:8692-8698), and benzyl-phosphonate linkages (Samstag (1996) Antisense Nucleic Acid Drug 
Dev 6:153-156). 

As used herein, the term "operably linked" refers to a functional relationship between two or more 
nucleic acid segments. Typically, it refers to the functional relationship of a transcriptional regulatory sequence 
to a transcribed sequence. For example, a TERT promoter sequence of the invention, including any 
combination of cis-acting transcriptional control elements, is operably linked to a coding sequence if it 
stimulates or modulates the transcription of the coding sequence in an appropriate host cell or other expression 
system. Generally, promoter transcriptional regulatory sequences that are operably linked to a transcribed 
sequence are physically contiguous to the transcribed sequence, i.e., they are cis-acting. However, some 
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transcriptional regulatory sequences, such as enhancers, need not be physically contiguous or located in close 
proximity to the coding sequences whose transcription they enhance. 

As used herein, "recombinant" refers to a polynucleotide synthesized or otherwise manipulated in vitro, 
to methods of using recombinant polynucleotides to produce gene products in cells or other biological systems, 
or to a polypeptide ("recombinant protein") encoded by a recombinant polynucleotide. "Recombinant means" 
also encompass the ligation of nucleic acids having coding or promoter sequences from different sources into 
an expression cassette or vector for expression of a fusion protein; or, inducible, constitutive expression of a 
protein (for example, a TERT promoter of the invention operably linked to a heterologous nucleotide, such as a 
polypeptide coding sequence). 

As used herein, the "sequence" of a gene (unless specifically stated otherwise) or nucleic acid refers 
to the order of nucleotides in the polynucleotide, including either or both strands of a double-stranded DNA 
molecule — the sequence of both the coding strand and its complement, or of a single-stranded nucleic acid 
molecule. For example, in alternative embodiments, the promoter of the invention comprises untranscribed, 
untranslated, and intron TERT sequences, as set forth in the exemplary SEQ ID NO:1 and SEQ ID NO:2. 

As used herein, the term "transcribable sequence" refers to any sequence which, when operably 
linked to a cis-acting transcriptional control element, such as the TERT promoters of the invention, and when 
placed in the appropriate conditions, is capable of being transcribed to generate RNA. 

The terms "identical" or percent "identity," in the context of two or more nucleic acids or polypeptide 
sequences, refer to two or more sequences or subsequences that are the same or have a specified percentage 
of nucleotides (or amino acid residues) that are the same, when compared and aligned for maximum 
correspondence over a comparison window, as measured using one of the following sequence comparison 
algorithms or by manual alignment and visual inspection. This definition also refers to the complement of a 
sequence. For example, in alternative embodiments, nucleic acids within the scope of the invention include 
those with a nucleotide sequence identity that is at least about 60%, at least about 75-80%, about 90%, and 
about 95% of the exemplary TERT promoter sequence set forth in SEQ ID NO:1 (including residues 44 to 
13544 of SEQ ID NO:1) or SEQ ID NO:2, and the intron TERT sequences capable of driving a reporter gene in 
telomerase positive cells. Two sequences with these levels of identity are "substantially identical." Thus, if a 
sequence has the requisite sequence identity to a TERT promoter sequence or subsequence of the invention, it 
also is a TERT promoter sequence within the scope of the invention. Preferably, the percent identity exists 
over a region of the sequence that is at least about 25 nucleotides in length, more preferably over a region that 
is at least about 50-100 nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, to which test 
sequences are compared. When using a sequence comparison algorithm, test and reference sequences are 
entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm 
program parameters are designated. Default program parameters can be used, or alternative parameters can 
be designated. The sequence comparison algorithm then calculates the percent sequence identity for the test 
sequence(s) relative to the reference sequence, based on the designated or default program parameters. A 
"comparison window", as used herein, includes reference to a segment of any one of the number of contiguous 
positions selected from the group consisting of from 25 to 600, usually about 50 to about 200, more usually 
about 100 to about 150 in which a sequence may be compared to a reference sequence of the same number of 
contiguous positions after the two sequences are optimally aligned. Alignment of sequences can be conducted 
by the local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the homology 
alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity method of 
Pearson & Lipman, Proc. Natl. Acad. Sci. USA 85:2444 (1988), by computerized implementations of these 
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algorithms (GAP. BESTFIT, FASTA. and TFASTA in the Wisconsin Genetics Software Package, Genetics 
Computer Group, 575 Science Dr., Madison, Wl). or by manual alignment and visual inspection. 

One example of a useful algorithm is PILEUP. PILEUP creates a multiple sequence alignment from a 
group of related sequences using progressive, pair-wise alignments to show relationship and percent sequence 
identity. It also plots a tree or dendrogram, showing the clustering relationships used to create the alignment. 
PILEUP uses a simplification of the progressive alignment method of Feng & DoolWle. J. Mol. Evol. 35:351-360 
(1987). The method used is similar to the method described by Higgins & Sharp, CABIOS 5:151-153 (1989). 
The program can align up to 300 sequences, each of a maximum length of 5,000 nucleotides or amino acids. 
The multiple alignment procedure begins with the pair-wise alignment of the two most similar sequences, 
producing a cluster of two aligned sequences. This cluster is then aligned to the next most related sequence or 
cluster of aligned sequences. Two clusters of sequences are aligned by a simple extension of the pair-wise 
alignment of two individual sequences. The final alignment is achieved by a series of progressive, pair-wise 
alignments. The program is run by designating specific sequences and their amino acid or nucleotide 
coordinates for regions of sequence comparison and by designating the program parameters. Using PILEUP, a 
reference sequence is compared to another sequence to determine the percent sequence identity relationship 
(whether the second sequence is substantially identical and within the scope of the invention) using the 
following parameters: default gap weight (3.00). default gap length weight (0.10), and weighted end gaps. 
PILEUP can be obtained from the GCG sequence analysis software package (Devereaux (1984) Nuc Acids 
Res. 12:387-395). 

Another example of algorithm that is suitable for determining percent sequence identity is the BLAST 
algorithm, which is described in Altschul (1990) J. Mol. Biol. 215:403-410. Software for performing BLAST 
analyses is publicly available through the National Center for Biotechnology Information 
(http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring sequence pairs (HSPs) by 
identifying short words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the 
neighborhood word score threshold (Altschul (1990) supra). These initial neighborhood word hits act as seeds 
for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions 
along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are 
calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues, always < 0). For amino acid sequences, a scoring 
matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when: 
the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative 
score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or 
the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity 
and speed of the alignment. In one embodiment, to determine if a nucleic acid sequence is within the scope of 
the invention, the BLASTN program (for nucleotide sequences) is used incorporating as defaults a word-length 
(W) of 11, an expectation (E) of 10, M=5. N=4. and a comparison of both strands. For amino acid sequences, 
the BLASTP program uses as default parameters a word-length (W) of 3, an expectation (E) of 10, and the 
BLOSUM62 scoring matrix (Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915). 

The BLAST algorithm also performs a statistical analysis of the similarity between two sequences 
(Karlin (1993) Proc. Natl. Acad. Sci. USA 90:5873-5787). One measure of similarity provided by the BLAST 
algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a 
match between two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is 
considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic 
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acid to the reference nucleic acid is less than about 0.1, more preferably less than about 0.01, and most 
preferably less than about 0.001. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, duplexing, or hybridizing of 
a molecule to a particular nucleotide sequence under stringent hybridization conditions when that sequence is 
present in a complex mixture (such as total cellular or library DNA or RNA), wherein the particular nucleotide 
sequence is detected at least twice background, preferably 10 times background. In one embodiment, a 
nucleic acid can be determined to be within the scope of the invention according to its ability to hybridize under 
stringent conditions to another nucleic acid (such as the exemplary sequences described herein). 

The phrase "stringent hybridization conditions" refers to conditions under which a probe will primarily 
hybridize to its target subsequence, typically in a complex mixture of nucleic acid, but to no other sequences. 
Stringent conditions are sequence-dependent and will be different in different circumstances, depending on the 
length of the probe. Longer sequences hybridize specifically at higher temperatures. An extensive guide to the 
hybridization of nucleic acids is found in Tijssen, Techniques in Biochemistry and Molecular Biology- 
Hybridization with Nucleic Probes, "Overview of principles of hybridization and the strategy of nucleic acid 
assays" (1993). Generally, stringent conditions are selected to be about 5-10°C lower than the thermal melting 
point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under 
defined ionic strength, pH, and nucleic concentration) at which 50% of the probes complementary to the target 
hybridize to the target sequence at equilibrium (as the target sequences are present in excess, at Tm, 50% of 
the probes are occupied at equilibrium). Stringent conditions will be those in which the salt concentration is 
less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or other salts) at pH 
7.0 to 8.3 and the temperature is at least about 30°C for short probes (-10 to about 50 nucleotides) and at least 
about 60°C for long probes (greater than about 50 nucJeotides). Stringent conditions may also be achieved 
with the addition of destabilizing agents such as formamide. For selective or specific hybridization, a positive 
signal (identification of a nucleic acid of the invention) is about 5-10 times background hybridization. "Stringent" 
hybridization conditions that are used to identify substantially identical nucleic acids within the scope of the 
invention include hybridization in a buffer comprising 50% formamide, 5x SSC, and 1% SDS at 42°C, or 
hybridization in a buffer comprising 5x SSC and 1% SDS at 65°C, both with a wash of 0.2x SSC and 0.1% SDS 
at 65°C, for long probes. For short probes, stringent hybridization conditions include hybridization in a buffer 
comprising 50% formamide, 5xSSC and 1% SDS at room temperature or hybridization in a buffer comprising 5 
x SSC and 1% SDS at 37°C - 42°C, both with a wash of 0.2 x SSC and 0.1% SDS at 37°C - 42°C. However, 
as is apparent to one of ordinary skill in the art, hybridization conditions can be modified depending on 
sequence composition. Moderately stringent hybridization conditions include a hybridization in a buffer of 40% 
formamide, 1 M NaCI, and 1% SDS at 37°C, and a wash in 1X SSC at 45°C. A positive hybridization is at least 
twice background. Those of ordinary skill will readily recognize that alternative hybridization and wash 
conditions can be utilized to provide conditions of similar stringency. 

General Techniques 

The TERT promoter sequences of the invention and nucleic acids used to practice this invention, 
whether RNA, cDNA, genomic DNA, or hybrids thereof, may be isolated from a variety of sources, genetically 
engineered, amplified, and/or expressed recombinantly. Any recombinant expression system can be used, 
including, bacterial, yeast, insect or mammalian systems. Alternatively, these nucleic acids can be chemically 
synthesized in vitro. Techniques for the manipulation of nucleic acids, such as subcloning into expression 
vectors, labeling probes, sequencing, and hybridization are well described in the scientific and patent literature. 
Molecular Cloning: A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor Laboratory, (1989) 
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("Sambrook"); Current Protocols In Molecular Biology, Ausubel. Ed. John Wiley & Sons, Inc., New York (1997) 
("Ausubel"); Laboratory Techniques In Biochemistry And Molecular Biology: Hybridization With Nucleic Acid 
Probes, Part I. Theory and Nucleic Acid Preparation, Tijssen, ed. Elsevier, N.Y. (1993). Nucleic acids can be 
analyzed and quantified by any of a number of techniques, including NMR, spectrophotometry, radiography 
electrophoresis, capillary electrophoresis, high pressure liquid chromatography (HPLC), thin layer 
chromatography (TLC). and hyperdiffusion chromatography, fluid or gel precipitin reactions, immunodiffusion 
(single or double), immunoelectrophoresis. radioimmunoassays (RIAs), enzyme-linked immunosorbent assays 
(ELISAs), immuno-fluorescent assays, Southern analysis. Northern analysis, dot-blot analysis gel 
electrophoresis, RT-PCR, quantitative PGR. other nucleic acid or target or signal amplification methods, 
radiolabeling, scintillation counting, and affinity chromatography. 

Preoarino hTERT promoter sequences 

Certain embodiments of the invention are TERT promoters comprising genomic sequences* 
(upstream) of an hTERT or mTERT transcriptional star. site, and intron sequences. TERT promoters contain 
c.s-act.ng transcriptional regulatory elements involved in TERT message expression. It will be apparent that, in 
addition to the nucleic acid sequences provided in hTERT SEQ ID NO:1 or mTERT SEQ ID NO:2. additional 
TERT promoter sequences can be readily obtained using routine molecular biological techniques For 
example, additional hTERT genomic (and promoter) sequence can be obtained by screening a human genomic 
library using an hTERT nucleic acid probe having a sequence or subsequence as set forth in SEQ ID NO-1 (a 
nucleic acid sequence is within the scope of the invention if it hybridizes under stringent conditions to an hTERT 
promoter sequence of the invention). Additional hTERT or mTERT genomic sequence can be readily identified 
by "chromosome walking" techniques, as described by Hauser (1998) Plant J 16:117-125- Min (1998) 
Biotechniques 24:398-400. Other useful methods for further characterization of TERT promoter sequences 
include those general methods described by Pang (1997) Biotechniques 22:1046-1048; Gobinda (1993) PC R 
Meth. Applic. 2:318; Triglia (1988) Nucleic Acids Res. 16:8186; Lagerstrom (1991) PCR Methods Applic. 1:111- 
Parker (1 991) Nucleic Acids Res. 1 9:3055. 

In some embodiments, the promoter sequence comprises at least about 15, 50, 100 150 200 250 
500, 1000, 2500 or 13,000 bases in SEQ ID NO:1 or SEQ ID NO:2. Included is a nucleic acid molecule 
comprising a TERT promoter, including but not limited to hTERT or mTERT, optionally linked to a heterologous 
sequence. The promoter may comprise about 100 to about 200, 200 to about 400, 400 to about 900 or 900 to 
about 2500, or 2500 to about 5000 nucleotides upstream of a translational start site. In other embodiments the 
promoter comprises a sequence that hybridizes with SEQ. ID NO:1 or 2. Exemplary are promoter sequences 
that preferentially promote transcription in cells expressing telomerase reverse transcriptase. Such sequences 
can be readily identified using the assays provided elsewhere in this disclosure and in the Examples, in which 
candidate promoter sequences are operably linked to the encoding region for a reporter protein, and then 
transfected into cells with known TERT activity to determine the specificity. 

The invention provides oligonucleotide primers that can amplify all or any specific region within the 
TERT promoter sequence of the invention, including specific promoter and enhancer subsequences. The 
nucleic acids of the invention can also be generated or measured quantitatively using amplification techniques. 
Using the TERT promoter sequences of the invention (as in the exemplary hTERT SEQ ID N0:1 or mTERT 
SEQ ID NO:2). the skilled artisan can select and design suitable oligonucleotide amplification primers 
Amplification methods Include polymerase chain reaction (PCR Protocols. A Guide To Methods And 
Applications, ed. Innis. Academic Press. N.Y. (1990) and PCR Strategies (1995), ed. Innis. Academic Press 
Inc.. N.Y.. ligase chain reaction (LCR) (Wu (1989) Genomics 4:560; Landegren (1988) Science 241-1077- 
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Bamnger (1990) Gene 89:117); transcription amplification (Kwoh (1989) Proc. Natl. Acad. Sci. USA, 86:1173); 
and, self-sustained sequence replication (Guatelli (1990) Proc. Natl. Acad. Sci. USA, 87:1874); Q (i-replicase 
amplification (Smith (1997) J. Clin. Microbiol. 35:1477-1491, automated Q-0 replicase amplification assay; Burg 
(1996) Mol. Cell. Probes 10:257-271) and other RNA polymerase mediated techniques (NASBA, Cangene, 
5 Mississauga, Ontario); Berger (1987) Methods Enzymol. 152:307-316, Sambrook, Ausubel, Mullis (1987) U.S. 
Patent Nos. 4,683,195, and 4,683,202; Arnheim (1990) C&EN 36-47; Lomell J. Clin. Chem., 35:1826 (1989); 
Van Brunt (1990) Biotechnology, 8:291-294; Wu (1989) Gene 4:560; Sooknanan (1995) Biotechnology 13:563- 
564. Once amplified, TERT genomic DNA, TERT promoter sequences, and the like, can be cloned, if desired, 
into any of a variety of vectors using routine molecular biological methods; methods for cloning in vitro amplified 
10 nucleic acids are described in Wallace, U.S. Pat. No. 5,426,039. 

The invention includes TERT promoter sequences that have been modified in a site-specific manner to 
alter, add to, or delete some or all of the promoter's functions. For example, specific base pairs can be 
modified to alter, increase or decrease the binding affinity to trans-acting transcriptional regulatory factors, thus 
modifying the relative level of transcriptional activation or repression. Modifications can also change secondary 
15 structures of specific subsequences, such as those associated with many cis-acting transcriptional elements. 
Site-specific mutations can be introduced into nucleic acids by a variety of conventional techniques, well 
described in the scientific and patent literature. Illustrative examples include site-directed mutagenesis by 
overlap extension polymerase chain reaction (OE-PCR), as in Urban (1997) Nucleic Acids Res. 25:2227-2228; 
Ke (1997) Nucleic Acids Res 25:3371-3372, and Chattopadhyay (1997) Biotechniques 22:1054-1056, 
20 describing PCR-based site-directed mutagenesis "megaprimer" method; Bohnsack (1997) Mol. Biotechnoi. 
7:181-188; Ailenberg (1997) Biotechniques 22:624-626, describing site-directed mutagenesis using a PCR- 
based staggered re-annealing method without restriction enzymes; Nicolas (1997) Biotechniques 22:430-434, 
site-directed mutagenesis using long primer-unique site elimination and exonuclease 111. Modified TERT 
promoter sequences of the invention can be further produced by chemical modification methods. Belousov 
25 (1997) Nucleic Acids Res. 25:3440-3444; Frenkel (1995) Free Radic. Biol. Med. 19:373-380; Blommers (1994) 
Biochemistry 33:7886-7896. 

The invention also provides antisense oligonucleotides capable of binding TERT promoter regions 
which, at least in part, modulate TERT transcription and telomerase activity. For example, antisense 
oligonucleotides that form triplexes with promoter regions inhibit the activity of that promoter. Joseph (1997) 
30 Nucleic Acids Res. 25:2182-2188; Alunni-Fabbroni (1996) Biochemistry 35:16361-16369; Olivas (1996) Nucleic 
Acids Res 24:1758-1764. Alternatively, antisense oligonucleotides that hybridize to the promoter sequence can 
be used to inhibit promoter activity. 

For example, antisense polynucleotides of the invention can comprise an antisense sequence of at 
least 7 to 10 to about 20 or more nucleotides that specifically hybridize to a sequence complementary to the 
35 TERT promoter sequences of the invention. Alternatively, the antisense polynucleotide of the invention can be 
from about 10 to about 50 nucleotides in length or from about 14 to about 35 nucleotides in length. In other 
embodiments, they are less than about 100 nucleotides or less than about 200 nucleotides. In general, the 
antisense polynucleotide should be long enough to form a stable duplex (or triplex) but, if desired, short 
enough, depending on the mode of delivery, to be administered in vivo. The minimum length of a 
40 polynucleotide required for specific hybridization to a target sequence depends on several factors, such as G/C 
content, positioning of mismatched bases (if any), degree of uniqueness of the sequence as compared to the 
population of target polynucleotides, and chemical nature of the nucleotides used in the antisense reagent 
(methylphosphonate backbone, peptide nucleic acid, phosphorothioate), among other factors. Methods relating 
to antisense polynucleotides, are also described in Antisense RNA And DNA, (1988), D.A. Melton, Ed., Cold 
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Spring Harbor Laboratory, Cold Spring Harbor, NY); Dagle (1991) Nucleic Acids Research 19:1805; Kim (1998) 
J. Controlled Release 53:175-182; for antisense therapy. Uhlmann (1990) Chem. Reviews 90:543-584; Poston 
(1998) J. Thorac. Cardiovasc. Surg. 116:386-396 (ex vivo gene therapy); Haller (1998) Kidney Int. 53:1550- 
1558; Nguyen (1998) Cancer Res 58:5673-7. 

Identifying TERT promoter subsequen ces bound hv transcriptional regulatory factors 

The invention provides means to identify and isolate trans-acting transcriptional regulatory factors that 
are involved in modulating the activity of the TERT promoter. Identification of cis-acting motifs by sequence 
identity comparison can be a useful initial means to identify promoter sequences bound by trans-acting factors. 
The hTERT promoter contains the motif known to bind to c-Myc (the "E-box" or "Myc/Max binding site"). Two 
SP1 binding sites are located starting at residue -168 and starting at residue -134. Other identified motifs 
include the sex determining region Y gene product (SRY), hepatic nuclear factor 3-beta (HNF-30) and hepatic 
nuclear factor 5 (HNF-5), TFIID-MBP, E2F and c-Myb cis-acting transcriptional regulatory elements. To identify 
these motifs, a variety of comparison algorithms can be used. Karas (1996) Comput. Appl. Biosci. 12:441-6; 
Freeh (1997) Pac Symp Biocomput. 7:151-62; Brzma (1998) Genome Res 8:1202-1215; Tsunoda (1998) Pac 
Symp Biocomput :1 998:252-63. 

In addition to sequence identity analysis, TERT cis-acting transcriptional regulatory elements can be 
identified by functional assays, including promoter activity assays, DNase assays, binding assays (mobility shift 
assays), and oligonucleotide affinity column chromatography. After positive or tentative identification of a cis- 
acting binding site in a TERT promoter, these sequences are used to isolate the trans-acting transcriptional 
regulatory factor(s). In a preferred embodiment, the trans-acting factors are isolated using sequence-specific 
oligonucleotide affinity chromatography, the oligonucleotides comprising TERT sequences of the invention. 

Another embodiment for identifying transcriptional regulatory motifs involves modifying putative cis- 
acting regulatory subsequences and assessing the change, if any. of the resultant TERT promoter to modulate 
transcription. The modification can be one or more residue deletions, residue substitutions, and chemical 
alterations of nucleotides. The (modified) promoter can be operably linked to TERT, a reporter gene, or any 
other transcribable sequence.. The relative increase or decrease the modification has on transcriptional rates 
can be determined by measuring the ability of the unaltered TERT promoter to transcriptionally activate the 
reporter coding sequence under the same conditions as used to test the modified promoter. An increase or 
decrease in the ability of the modified TERT promoter to induce transcription as compared to the unmodified 
promoter construct identifies a cis-acting transcriptional regulatory sequence that is involved in the modulation 
of TERT promoter activity. 

The reporter gene can encode a fluorescent or phosphorescent protein, or a protein possessing 
enzymatic activity. In alternative embodiments, the detectable protein is firefly luciferase, a-glucuronidase. a- 
galactosidase, chloramphenicol acetyl transferase, green fluorescent protein, enhanced green fluorescent 
protein, and the human secreted alkaline phosphatase. Another embodiment tests the ability of these cis- 
acting elements to bind soluble polypeptide trans-acting factors isolated from different cellular compartments, 
particularly trans-acting factors expressed in nuclei. For identification and isolation of factors that stimulate 
transcription, nuclear extracts from cells that express TERT are used. 

Furthermore, once a cis-acting motif, or element, is identified, it can be used to identify and isolate 
trans-acting factors in a variety of cells and under different conditions (such as cell proliferation versus cell 
senescence). Accordingly, the invention provides a method for screening for trans-acting factors that modulate 
TERT promoter activity under a variety of conditions, developmental states, and cell types (including normal 
versus immortal versus malignant phenotypes). The cis-acting transcriptional regulatory sequences of the 
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invention that modulate TERT promoter activity can also be used as oligonucleotides which, upon introduction 
into a cell, can bind trans-acting regulatory factors to modulate TERT transcription in vivo. This results in 
increased or decreased cell proliferative capacity for the treatment of various diseases and conditions. 

5 High throughput screening of small molecu l e modulators of TERT transcri ption 

The invention provides constructs and methods for screening modulators, in a preferred embodiment, 
small molecule modulators, of TERT promoter activity in vitro and in vivo. The invention incorporates all assays 
available to screen for small molecule modulators of TERT transcription. In a preferred embodiment, high 
throughput assays are adapted and used with the novel TERT promoter seguences and constructs provided by 
10 the invention. Schultz (1998) Bioorg Med Chem Lett 8:2409-2414; Weller (1997) Mol Divers. 3:61-70; 
Femandes (1998) Curr Opin Chem Biol 2:597-603; Sittampalam (1997) Curr Opin Chem Biol 1:384-91 . 

In alternative embodiments, recombinant constructs contain hTERT promoter seguences driving a 
marker, such as an alkaline phosphatase marker gene (SEAP) or a §-galactosidase gene. Using a SEAP 
expressing construct of the invention, it was demonstrated that a TERT promoter fragment of approximately 2.5 
15 kb is sufficient to activate and repress TERT transcription in response to proliferation arid/or growth arrest 
stimuli in a model cell line, IDH4. Two cell clones. ID245-1 and ID245-16 whose SEAP profiles closely matched 
telomerase activity after TERT up-reguiation by dexamethasone were selected and expanded for high 
throughput screening of small molecule activators of telomerase. 

20 Treatment of diseases associated with altered telomerase expression 

The present invention provides TERT promoter seguences useful for the treatment of diseases and 
disease conditions. The recombinant and synthetic nucleic acids comprising TERT promoter, or TERT 
antisense complementary seguences, can be used to create or elevate telomerase activity in a cell, as well as 
to inhibit telomerase activity in cells in which it is not desired. In a preferred embodiment, human TERT 

25 promoter seguences or antisense seguences are used for the treatment of human diseases and disease 
conditions. 

Identification of cis-acting transcriptional regulatory seguences by the invention further provides for the 
design of targeted seguences that, as oligonucleotides, can modify TERT promoter activity. In one 
embodiment, telomerase activity is created or elevated by binding significant amounts of a trans-acting 

30 transcriptional repressor or down-regulator with a nucleic acid that binds specifically to the repressor. In 
another embodiment, telomerase activity is down-regulated by antisense oligonucleotides binding to promoter 
seguences. Similarly, telomerase activity can be inhibited by binding significant amounts of a trans-acting 
transcriptional activator or up-regulator with a nucleic acid that binds specifically to the activator; or telomerase 
activity is up-regulated by antisense oligonucleotides binding to promoter seguences involved in telomerase 

35 repression. Thus, inhibiting, activating or otherwise altering a telomerase activity (telomerase catalytic activity, 
fidelity, processivity, telomere binding, etc.) in a cell can be used to change the proliferative capacity of the cell. 

For example, reduction of telomerase activity in an immortal cell, such as a malignant tumor cell, can 
render the cell mortal. Conversely, increasing the telomerase activity in a cell line or a mortal cell (most human 
somatic cells) can increase the proliferative capacity of the cell. For example, expression of hTERT protein in 

40 dermal fibroblasts, thereby increasing telomere length, will result in increased fibroblast proliferative capacity. 
Such expression can slow or reverse age-related degenerative processes, such as the age-dependent slowing 
of wound closure (West (1994) Arch. Derm. 130:87). Thus, in one aspect, the present invention provides 
reagents and methods useful for treating diseases and conditions characterized by the presence, absence, or 
altered amount of human telomerase activity in a cell (where the diseases and conditions are susceptible to 
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treatment using the compositions and methods disclosed herein). These diseases include, e.g. cancers, other 
diseases of cell proliferation (particularly, degenerative and aging processes and diseases of aging), 
immunological disorders, infertility (or fertility). 

5 TERT promoter operably lin ked to cellular toxins 

In one embodiment, the TERT promoter of the invention is operably linked to a transcribable sequence 
that encodes a cellular toxin. Polypeptide toxins that can be recombinantly generated include ricin, abrin 
(Hughes (1996) Hum. Exp. Toxicol. 15:443-451), diphtheria, gelonin (Rosenblum (1996) Cancer Immunol. 
Immunother. 42:115-121), Pseudomonas exotoxin A, tumor necrosis factor alpha (TNF-a), Crotalus durissus 
10 terrlficus toxin, Crotalus adamenteus toxin, Naja naja toxin, and Naja mocambique toxin. Rodriguez (1998) 
Prostate 34:259-269; Mauceri (1996) Cancer Res. 56:4311-4314. The cellular toxin can also be capable of 
inducing apoptosis, such as the ICE-family of cysteine proteases, the Bcl-2 family of proteins, bax, bclXs and 
caspases. Favrot (1998) Gene Then 5:728-739; McGill (1997) Front. Biosci. 2:D353-D379; McDonnell (1995) 
Semin. Cancer Biol. 6:53-60. 

15 Alternatively, the sequence under the control of the TERT promoter can code for polypeptides having 

activity that is not itself toxic to a cell, but which renders the cell sensitive to an otherwise nontoxic drug, such 
as Herpes virus thymidine kinase (HSV-TK). The HSV-TK is innocuous but converts the anti-herpetic agent 
ganciclovir (GCV) to a toxic product that interferes with DNA replication in proliferating cells. Delaney (1996) J. 
Neurosci. 16:6908-6918; Heyman (1989) Proc. Natl. Acad. Sci. USA 86:2698-2702. The art describes 

20 numerous other suitable toxic or potentially toxic proteins and systems that may be applied in this embodiment. 

The methods of the invention, in addition to enabling the specific killing of telomerase-positive cells, 
can also be used to prevent transformation of telomerase negative cells to a telomerase positive state. As 
shown in the examples below, an hTERT promoter sequence can be operably linked to a reporter gene such 
that activation of the promoter results in expression of the protein encoded by the reporter gene. If, instead of a 

25 reporter protein, the encoded protein is toxic to the cell, activation of the promoter leads to cell morbidity or 
death. 



Oncolytic viruses and toxins for treating cancer 

The present invention provides methods and compositions for reducing TERT promoter activity (and 

30 hence telomerase activity) in immortal cells and tumor cells for treating cancer. Cancer cells (malignant tumor 
cells) that express telomerase activity (telomerase-positive cells) can be mortalized by decreasing or inhibiting 
TERT promoter activity. Moreover, because measurable telomerase activity levels correlate with disease 
characteristics such as metastatic potential (U.S. Patent Nos. 5,639,613; 5,648,215; 5,489,508; and Pandita 
(1996) Proc. Am. Ass. Cancer Res. 37:559), any reduction in TERT promoter activity could reduce the 

35 aggressive nature of a cancer to a more manageable disease state. 

Taking advantage of this characteristic, in one embodiment of the invention, a TERT promoter 
sequence is operably linked to a gene encoding a toxin and introduced into a cell to kill the cell (such as ricin, 
diphtheria, gelonin, Pseudomonas toxin, abrin). If or when TERT transcriptional activators are expressed or 
activated in the cell, the toxin will be expressed, resulting in specific cell killing. 

40 Alternatively, the TERT promoter-linked gene can encode a protein having activity that is not itself 

toxic to a cell, but which renders the cell sensitive to an otherwise nontoxic drug (such as Herpes virus 
thymidine kinase). 

In another embodiment, the invention takes advantage of the fact that normal cytopathic viruses, in 
particular human cytopathic viruses, such as adenovirus or Herpes virus, require essential virally encoded 
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genes to proliferate thereby lysing specific cells. Based on the description that follows, those skilled in the art 
will recognize that a number of different cytopathic viruses can be adapted according to this invention. 
Cytopathic viruses are well known in the art, and are described inter alia in publications by Coffey, Toda, 
Chase, and Kramm, infra. Genes essential for replication have been characterized in many such viruses. If an 
5 essential replication gene of any of these viruses is driven by the TERT promoter, proliferation of the virus and 
its cytopathic effects would be restricted to tumor cells and other telomerase expressing cells. For example, 
some essential genetic elements for replication of adenovirus are the E4, E1a, E1b, and E2 regions, or any of 
the late gene products. Essential genetic elements for replication of HSV-1 include ICP6 and ICP4. 

Accordingly, the invention provides constructs and methods for killing telomerase positive cells (such 
10 as cancer cells) wherein TERT promoter sequences of the invention are operably linked to such essential 
replication genetic elements. For use in human cells, human cytopathic viruses modified with hTERT promoter 
sequences are preferred. Any one or more of the genes required for the replication and packaging of the virus 
could be modified to be driven by the TERT promoter. For instance, in one embodiment, expression of the E1a 
gene of adenovirus, which is required for the activation of expression of a cascade of adenoviral genes, is 
1 5 placed under the control of the hTERT promoter. 

Thus, expression of E1a, and hence downstream replication of the virus, occurs only in those cells that 
express telomerase (such as tumor cells). Likewise, a recombinant adenovirus of the invention is designed so 
the adenoviral capsid genes are under the control of a TERT promoter. While this construct replicates its DIM A 
in most cell types, it packages itself into active, infectious (and cytotoxic) virus only in those cells that express 
20 telomerase. Thus, when these constructs are used as cancer therapeutics, the conditionally replicative virus 
only infects and yields a productive infection in tumor cells (with no effect in "normal" cells that do not express 
telomerase). Infection of normal cells that do not express telomerase is expected to produce either no virus or 
abortive production of the virus, depending on which gene is driven by the TERT promoter. Thus, these 
recombinant viruses of the invention allow the natural, yet tumor specific, amplification of an oncolytic virus. 
25 ,n alternative embodiments, many other elements are incorporated into a TERT promoter restricted 

oncolytic virus or a TERT promoter restricted replicative virus that is not lytic. Genes encoding suicide genes, 
marker genes, apoptotic genes or cell cycle regulators are incorporated in the TERT promoter restricted 
conditionally replicative recombinant virus. Expression of these elements in such a virus would assist the arrest 
of tumor growth. In one embodiment, elements to be included within these conditionally replicative viruses of 
30 the invention are structures that inhibit telomerase activity. These telomerase inhibitors could incorporate 
inhibitory oligonucleotides, dominant-negative inhibitors of TERT, or the gene for any agent that would disrupt 
or prevent TR/TERT assembly, interactions, or activity. 

Other elements can also be included in the TERT promoter restricted vectors of the invention. For 
example, small inhibitory RNA molecules, preferably targeting cancer cells, such as RNA targeting telomerase 
35 activity can be synthesized in vivo using a recombinant adenovirus vector. Exemplary sequences are provided 
in US Patent No. 5,858,777 and GB 20890.4. RNA production from the adenovirus can be achieved by a 
variety of expression cassettes. For cell growth inhibition purposes. RNA polymerase III expression cassettes 
based on the structure of tRNA genes and other RNA polymerase III transcripts, including the U6 snRNA gene, 
as well as RNA polymerase II snRNP (U1, U2) transcripts are preferred due to their ability to produce high 
40 levels of transcripts. 

The hTERT promoter restricted viruses of the invention can be designed to express inhibitory RNAs, 
as antisense molecules complementary to several regions of the hTR molecule, including the template region. 
The inhibitory RNAs can also mimic sequences and/or structures present in the RNA component of telomerase 
(e.g., hTR), including potential binding site(s) for TERT or other telomerase-associated proteins that might 
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Interact with the RNA component. Other elements can also be designed to generate inhibitory RNAs to target 
TERT mRNA by preventing its normal processing, folding, modification, transport and/or translation. 

Other cytopathic viral vectors of the invention can be designed to generate RNA molecules with 
sequences necessary for cytoplasmic export and translation into peptides. The resulting polypeptides or 
peptides can be designed to target telomerase components or other molecules that are associated with 
telomerase thereby influencing telomerase catalytic activity. The peptides that inhibit telomerase will be 
produced at high level, paralleling the amount of RNA. For example, peptides could be designed to mimic the 
stretch of amino acids in hTERT involved in its binding to hTR, thereby acting as competitors in the assembly of 
a functional telomerase. 

The TERT promoter restricted viral vectors of the invention can also be designed to generate peptides 
or polypeptides for any domain of TERT involved in interactions with other proteins and disrupt contacts that 
are essential for telomerase function. Other TERT promoter restricted viruses of the invention can be designed 
to generate polypeptides to bind to telomere complexes and prevent access and/or docking of telomerase or to 
generate immunogenic peptides, in part TERT peptides. 

Other TERT promoter restricted viral vectors of the invention can be designed to generate 
polypeptides to mimic a variety of apoptosis inducing agents observed during programmed cell death and could 
result in the onset of apoptosis. TERT promoter restricted viruses do not necessarily need to be cytopathic. 
The TERT promoter conditionally restricted virus could be used to amplify any sequences or any element in any 
TERT expressing cell, such as a tumor cell. 

Any of these embodiments can be provided with the conditionally replicative viruses of the invention. 
The TERT promoter constructs of the invention can also be used in gene therapy vectors to prevent telomerase 
activation and result in specific mortalization or death of telomerase-positive cells. Similarly, these gene 
therapy methods may be used for treating a genetic predilection for cancers. 

25 Treatment of other conditions 

The present invention also provides compositions and methods useful for treatment of diseases and 
disease conditions (in addition to cancers) characterized by under- or over-expression of telomerase or TERT 
gene products. Examples include diseases of cell proliferation, diseases resulting from cell senescence 
(particularly processes and diseases of aging), immunological disorders, infertility, and diseases of immune 
dysfunction. Certain diseases of aging are characterized by cell senescence-associated changes due to 
reduced telomere length (compared to younger cells), resulting from the absence (or much lower levels) of 
telomerase activity in the cell. Decreased telomere length and decreased replicative capacity contribute to 
these diseases. Telomerase activity (resulting in increased telomere length) can be up-regulated by increasing 
TERT promoter activity in the cell. 

The present invention, by providing methods and compositions for modulating TERT promoter activity, 
also provides methods to treat infertility. Human germline cells (spermatogonia cells, their progenitors or 
descendants) are capable of indefinite proliferation and characterized by high telomerase activity. Abnormal or 
diminished levels of TERT gene products can result, in inadequate or abnormal production of spermatozoa, 
leading to infertility or disorders of reproduction. Accordingly, infertility associated with altered telomerase 
40 activity can be treated using the methods and compositions described herein to increase TERT promoter 
activity levels. Similarly, because inhibition of telomerase may negatively impact spermatogenesis, oogenesis, 
and sperm and egg viability, the compositions of the invention capable of inhibiting hTERT promoter activity can 
have contraceptive effects when used to reduce hTERT levels in germline cells. 



30 



35 



— 18 — 



10 



WO 00/46355 PCT/USOO/03104 
In a further embodiment, the invention provides methods and composition useful for decreasing the 
proliferative potential of telomerase-positive cells such as activated lymphocytes and hematopoietic stem cells 
by reducing TERT promoter activity. Thus, the invention provides means for effecting immunosuppression. 
Conversely, the methods and reagents of the invention are useful in immunostimulation by increasing TERT 
promoter activity (resulting in increased proliferative potential) in immune cells, including hematopoietic stem 
cells (that express a low level of telomerase or no telomerase prior to therapeutic intervention). 

Modulating TERT promoter activity 

As is clear from the foregoing discussion, modulation of the level of TERT promoter transcriptional 
activity (and thus, the levels of telomerase or telomerase activity of a cell) can have a profound effect on the 
proliferative potential of the cell, and so has great utility in treatment of disease. This modulation can either be 
a decrease or an increase in TERT promoter activity. The promoter activity-modulatory nucleic acid molecules 
of the invention can act through a number of mechanisms. However, the invention is not limited to any 
particular mechanism of action. 

For example. TERT promoter activity may be decreased or increased by single stranded antisense 
sequences that directly bind to TERT promoter sequences. This will result in decrease in affinity or inhibition of 
trans-acting transcriptional regulatory factors binding to critical TERT promoter sequences (TATA boxes, CAAT 
boxes, and the like). When the cis-acting element bound by a trans-acting factor has inhibitory activity, the 
binding of the oligonucleotide would result in up-regulation of TERT transcription. Conversely, if the promoter 
subsequence, when bound by a trans-acting factor, has up-regulating activity, the binding of the oligonucleotide 
would result In down-regulation of TERT transcription. In another embodiment, double-stranded 
oligonucleotides representing TERT promoter subsequences directly bind trans-acting transcriptional 
modulatory elements, thus preventing them from binding their corresponding cis-acting elements. In summary, 
TERT promoter activity may be increased or decreased through any of several mechanisms, or a combination 
25 of mechanisms. These include any means apparent to those of skill upon review of this disclosure. 

The cis-acting transcriptional regulatory sequences of the invention can also be used as 
oligonucleotides which, upon introduction into a cell, can bind trans-acting regulatory factors to modulate TERT 
transcription in vivo. These oligonucleotides can be delivered to target cells through an appropriate delivery 
scheme or they can be synthesized in vivo by recombinant expression systems (vectors, viruses, and the like). 
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Oligonucl eotides and other pharmaceutical compositions 

Antisense oligonucleotides which hybridize to TERT promoter sequences will inhibit the binding of 
trans-acting transcriptional regulatory agents to critical TERT promoter sequences. Furthermore, the result will 
be activation or repression of TERT transcriptional activity, depending on whether the promoter subsequence is 
down-regulatory or up-regulatory, respectively. Thus, the invention provides antisense oligonucleotides 
directed to the TERT promoter (cis-acting) binding sites for c-Myc (the "E-box" or "Myc/Max binding sites"), 
8P1, Y gene product (SRY), HNF-30, HNF-5, TFIID-MBP, E2F, c-Myb, TATA boxes, CAAT boxes, and other 
regulatory elements. 

TERT polynucleotides can be produced by direct chemical synthesis. Chemical synthesis will typically 
be used to produce oligonucleotides and polynucleotides containing nonstandard nucleotides (probes, primers 
and antisense oligonucleotides) although nucleic acids containing only standard nucleotides can also be 
prepared. Direct chemical synthesis of nucleic acids can be accomplished for example by the phosphotriester 
method of Narang (1979) Meth. Enzymol. 68:90; the phosphodiester method of Brown (1979) Meth. Enzymol. 
68:109; the diethyl-phosphoramidite method of Beaucage (1981) Tetra. Lett. 22:1859; and the solid support 
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method of U.S. Patent No. 4,458,066. Chemical synthesis typically produces a single stranded oligonucleotide, 
which may be converted into double stranded DNA by hybridization with a complementary sequence, or by 
polymerization with a DNA polymerase and an oligonucleotide primer using the single strand as a template. 
One of skill will recognize that while chemical synthesis of DNA is often limited to sequences of less than about 
100 or 150 bases, longer sequences may be obtained by the ligation of shorter sequences or by more 
elaborate synthetic methods. It will be appreciated that the polynucleotides and oligonucleotides of the 
invention can be made using nonstandard bases (other than adenine, cytidine, guanine, thymine, and uridine) 
or nonstandard backbone structures to provide desirable properties (increased nuclease-resistance, tighter 
binding, stability or a desired Tm). Techniques for rendering oligonucleotides nuclease-resistant include those 
described in PCT publication WO 94/12633. A wide variety of useful modified oligonucleotides may be 
produced, including oligonucleotides having a peptide nucleic acid (PNA) backbone (Nielsen (1991) Science 
254:1497) or incorporating 2'-0-methyl ribonucleotides, phosphorothioate nucleotides, methyl phosphonate 
nucleotides, phosphotriester nucleotides, phosphorothioate nucleotides, and phosphoramidates. Still other 
useful oligonucleotides may contain alkyl and halogen-substituted sugar moieties comprising one of the 
following at the 2' position: OH. SH, SCH 3 . F, OCN, OCH3OCH3, OCH 3 0(CH 2 )nCH 3 , 0(CH 2 )nNH 2 or 
0(CH 2 )nCH 3 where n is from 1 to about 10; C1 to C10 lower alkyl. substituted lower alkyl. alkaryl or aralkyl; CI; 
Br; CN; CF 3 ; OCF 3 ; O-, S-, or N-alkyl; 0-, S-. or N-alkenyl; SOCH 3 ; S0 2 CH 3 ; ON0 2 ; N0 2 ; N 3 ; NH 2 ; 
heterocycloalkyl; heterocycloalkaryl; amino-alkylamino; polyalkylamino; substituted silyl; an RNA cleaving 
group; a cholesteryl group; a folate group; a reporter group; an intercalator; a group for improving the 
pharmacokinetic properties of an oligonucleotide; or a group for improving the pharmacodynamic properties of 
an oligonucleotide and other substituents having similar properties. Folate, cholesterol or other groups which 
facilitate oligonucleotide uptake, such as lipid analogs, may be conjugated directly or via a linker at the 2' 
position of any nucleoside or at the 3" or 5' position of the 3'-terminal or 5'-terminal nucleoside, respectively. 
One or more such conjugates may be used. Oligonucleotides may also have sugar mimetics such as 
cyclobutyls in place of the pentofuranosyl group. Other embodiments may include at least one modified base 
form or "universal base" such as inosine. or inclusion of other nonstandard bases such as queosine and 
wybutosine as well as acetyl-, methyl-, thio- and similarly modified forms of adenine, cytidine, guanine, thymine, 
and uridine which are not as easily recognized by endogenous endonucleases. The invention further provides 
oligonucleotides having backbone analogues such as phosphodiester, phosphorothioate. phosphorodithioate. 
methylphosphonate. phosphor-amidate. alkyl phosphotriester. sulfamate, 3-thioacetal, methylene(methylimino). 
3'-N-carbamate. morpholino carbamate, chiral-methyl phosphonates, nucleotides with short chain alkyl or 
cycloalkyl intersugar linkages, short chain heteroatomic or heterocyclic intersugar ("backbone") linkages, or 
CH^NH-O-CH,, CH 2 -N(CH 3 )-OCH 2 . CH 2 -0-N(CH 3 )-CH 2 . CH 2 -N(CH 3 )-N(CH 3 )-CH 2 and O-NfCH^-CHrCHfe 
backbones (where phosphodiester is O-P-O-CH,), or mixtures of the same. Also useful are oligonucleotides 
having morpholino backbone structures (U.S. Patent No. 5,034,506). 

While the invention is not limited by any particular mechanism, oligonucleotides of the invention can 
also bind to double-stranded or duplex TERT promoter sequences. They can bind in a folded region, forming a 
triple helix, or "triplex" nucleic acid. Triple helix formation results in inhibition of TERT promoter activity by. 
disrupting the secondary structure of the promoter sequence, resulting in a new conformation which the trans- 
acting factor cannot bind with sufficient affinity to have a transcriptional-modifying effect. Alternatively, triple 
helix formation (induced by the binding of the antisense oligonucleotide of the invention) compromises the 
ability of the double helix to open sufficiently for the binding of polymerases, transcription factors, or regulatory 
trans-acting molecules to occur. Triplex oligonucleotide and polynucleotide construction is described in Cheng 
(1988) J. Biol. Chem. 263:15110; Ferrin (1991) Science 354:1494; Ramdas (1989) J. Biol. Chem. 264:17395; 
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Strobel (1991) Science 254:1639; Rigas (1986) Proc. Natl. Acad. Sci. U.S.A. 83: 9591) Carr, 1994, Molecular 
and Immunological Approaches, Futura Publishing Co, Mt Kisco NY; Rininsland (1997) Proc. Natl. Acad. Sci. 
USA 94:5854; Perkins (1998) Biochemistry 37:11315-1 1322. 

The therapeutic nucleic acids and methods of the invention involve the administration of 
5 oligonucleotides or polynucleotides that function to inhibit or stimulate TERT promoter activity under in vivo 
physiological conditions. In one embodiment, these nucleic acids are single stranded antisense sequences 
capable of binding to promoter sequences. In an alternative embodiment, they are double stranded nucleic 
acids capable of binding trans-acting transcriptional regulatory factors. They should be sufficiently stable under 
physiological conditions for a period of time to obtain a therapeutic effect. Modified nucleic acids can be useful 

10 in imparting such stability, as well as for targeting delivery of the oligonucleotide to the desired tissue, organ, or 
cell. Oiigo- and poly-nucleotides can be delivered directly as a drug in a suitable pharmaceutical formulation, or 
indirectly by means of introducing a nucleic acid expression system that can recombinantly generate the 
hTERT promoter modulating oligonucleotides into a cell. In one embodiment, oligonucleotides directly bind to 
cis-acting sequences or, alternatively, bind to trans-acting regulatory factors. One embodiment exploits the fact 

15 that the TERT promoter is only relatively active in a very limited range of cell types, including, significantly, 
cancer cells. 

Oligonucleotides or expression vectors can be administered by liposomes, immunoliposomes, 
ballistics, direct uptake into cells, and the like. For treatment of disease the oligonucleotides of the invention 
are administered to a patient in a therapeutically effective amount, which is an amount sufficient to ameliorate 

20 the symptoms of the disease or modulate hTERT promoter activity (thereby affecting telomerase activity) in the 
target cell. Methods useful for delivery of oligonucleotides for therapeutic purposes are described in U.S. 
Patent 5,272,065. Telomerase activity can be measured by TRAP assay or other suitable assay of telomerase 
biological function, as discussed in detail in other publications. 

The invention provides pharmaceutical compositions that comprise TERT promoter-containing nucleic 

25 acids (polynucleotides, expression vectors, gene therapy constructs) alone or in combination with at least one 
other agent, such as a stabilizing compound, diluent, carrier, cell targeting agent, or another active ingredient or 
agent. The therapeutic agents of the invention may be administered in any sterile, biocompatible 
pharmaceutical carrier, including, but not limited to, saline, buffered saline, dextrose, and water. Any of these 
molecules can be administered to a patient alone, or in combination with other agents, drugs or hormones, in 

30 pharmaceutical compositions where it Is mixed with suitable excipients. adjuvants, and/or pharmaceutical^ 
acceptable carriers. 

The pharmaceutical compositions of the invention can be administered by any means. Methods of 
parenteral delivery include topical, intra-arterial, intramuscular (IM), subcutaneous (SC), intramedullary, 
intrathecal, intraventricular, intravenous (IV), intraperitoneal (IP), or intranasal administration. Further details on 

35 techniques for formulation and administration may be found in the latest edition of Remington's Pharmaceutical 
Sciences (Maack Publishing Co, Easton PA); PCT publication WO 93/23572. 

Pharmaceutical compositions of the invention include TERT-containing nucleic acids in an effective 
amount to achieve the intended purpose. "Therapeutically effective amount" or "pharmacologically effective 
amount" are well recognized phrases and refer to that amount of an agent effective to produce the intended 

40 pharmacological result. For example, a therapeutically effective amount is an amount sufficient to treat a 
disease or condition or ameliorate the symptoms of the disease being treated. Useful assays to ascertain an 
effective amount for a given application includes measuring the effect on endogenous TERT promoter activity 
and telomerase activity in a target cell. The amount actually administered will be dependent upon the individual 
to which treatment is to be applied, and will preferably be an optimized amount such that the desired effect is 
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achieved without significant side effects. The therapeutically effective dose can be estimated initially either in 
cell culture assays or in any appropriate animal model. The animal model is also used to estimate appropriate 
dosage ranges and routes of administration in humans. Thus, the determination of a therapeutically effective 
dose is well within the capability of those skilled in the art. 

5 

Cells lines and animals with modified promoter sequences 

Most vertebrate cells senesce after a finite number of divisions in culture (-50 to 100 divisions). 
Certain variant cells, however, are able to divide indefinitely in culture (e.g., HeLa cells, 293 cells) and, for this 
reason, are useful for research and industrial applications. Usually these immortal cell lines are derived from 
10 spontaneously arising tumors, or by transformation by exposure to an oncogene, radiation or a tumor-inducing 
virus or chemical. Unfortunately, a limited selection of cell lines, especially human cell lines representing 
differentiated cell function, Is available. Moreover, many immortal cell lines presently available are 
characterized by chromosomal abnormalities (aneuploidy, gene rearrangements, or mutations). Further, many 
long-established cell lines are relatively undifferentiated. Thus, there is a need for the TERT promoter 
15 activating compositions and methods of the invention to generate new immortal cell lines, especially using cells 
of human origin, where hTERT promoter activating compositions and methods are preferred. 

The "immortalized cells" of the invention are not limited to those that proliferate indefinitely, but also 
include cells with increased proliferative capacity compared to similar cells whose TERT promoter has not been 
up-regulated. Depending on the cell type, increased proliferative capacity may mean proliferation for at least 
20 about 50, about 100, about 150, about 200, or about 400 or more generations, or for at least about 3, about 6, 
about 12, about 18, about 24 or about 36 or more months in culture. 

Uses for cells with increased proliferative capacity include the production of natural proteins and 
recombinant proteins (therapeutic polypeptides such as erythropoietin, human growth hormone, insulin, and the 
like), or antibodies, for which a stable, genetically normal cell line is preferred. Another use is for replacement 
25 of diseased or damaged cells or tissue. For example, autologous immune cells immortalized using an TERT 
promoter sequence of the invention can be used for cell replacement in a patient after aggressive cancer 
therapy, such as whole body irradiation. Another use for immortalized cells is for ex vivo production of 
"artificiar tissues or organs for therapeutic use. Another use for such cells is for screening or validation of 
drugs, such as telomerase-inhibiting drugs, or for use in production of vaccines or biological reagents. 
30 Additional uses of the cells of the invention will be apparent to those of skill. 

The invention also provides non-human transgenic animals comprising heterologous TERT or 
recombinant constructs comprising endogenous TERT promoter. In a preferred embodiment, the transgenic 
animals of the invention comprise a TERT promoter driving a heterologous gene, such as a reporter gene 
coding sequence, in a preferred embodiment, an hTERT promoter of the invention is operably linked to a 
- 35 reporter gene in a transgenic mouse. Alternatively, an mTERT promoter is operably linked to a reporter gene in 
a transgenic mouse. These transgenic animals are very useful as in vivo animal models to screen for 
modulators of TERT transcriptional activity. The introduction of hTERT, mTERT or other TERT promoters into 
animals to generate transgenic models is also used to assess the consequences of mutations or deletions to 
the transcriptional regulatory regions. 
40 In one embodiment, the endogenous TERT gene in these mice is still functional and wild-type (native) 

telomerase activity can still exist. A TERT promoter of the invention is used to drive a high level expression of 
an exogenous TERT construct, the endogenously produced mTERT protein can be competitively replaced with 
the introduced, exogenous TERT protein. This transgenic animal (retaining a functional endogenous 
telomerase activity) is preferred in situations where it is desirable to retain "normal," endogenous telomerase 
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function and telomere structure. In other situations, where it is desirable that all telomerase activity is by the 
introduced exogenous TERT protein, a mTERT knockout line can be used 

Promoter function, and in a preferred embodiment, hTERT promoter function, can be assessed with 
these transgenic animals. Alterations of TERT promoters can be constructed that drive TERT or a reporter 
gene to assess their function and expression pattern and characteristics (the invention also provides constructs 
and animals and methods for gene expression driven by a TERT promoter by transient transfection). 

In one embodiment, the TERT promoters and reagents of the invention are used to create mouse cells 
and transgenic animals in which the endogenous TERT promoter is deleted, modified, supplemented or 
inhibited. For example, TERT promoter sequences can be deleted, modified or inhibited on either one or both 
alleles. The cells or animals can be reconstituted with a wild-type or modified TERT promoter, or, in a preferred 
embodiment, an exogenous TERT in the form of hTERT. 

Construction of a "knockout" cell and animal is based on the premise that the level of expression of a 
particular gene in a mammalian ceil can be decreased or completely abrogated by introducing into the genome 
a new DNA sequence that serves to interrupt some portion of the DNA sequence of the gene/promoter to be 
suppressed. To prevent expression of endogenous promoter, simple mutations that alter or disrupt the 
promoter can be suitable. To up-regulate expression, a native TERT promoter can be substituted with a 
heterologous or mutated TERT promoter that induces higher levels of transcription, or with multiple copies of 
transgene TERT promoters. Also, "gene trap insertion" can be used to disrupt a host gene, and mouse 
embryonic stem (ES) cells can be used to produce knockout transgenic animals, as described herein and in 
Holzschu (1997) Transgenic Res 6: 97-106. 

Vectors specifically designed for integration by homologous recombination comprising TERT promoter 
sequences are also provided by the invention. Important factors for optimizing homologous recombination 
include the degree of sequence identity and length of homology to chromosomal sequences. The specific 
sequence mediating homologous recombination is also important, because integration occurs much more 
easily in transcriptionally active DNA. Methods and materials for constructing homologous targeting constructs 
are described by Mansour (1988) Nature 336: 348; Bradley (1992) Biotechnology 10:534; U.S. Patent Nos. 
5,627,059; 5,487,992; 5,631,153; and 5,464,764. 

In a preferred embodiment, cell and transgenic animal models express TERT promoter (particularly, 
hTERT promoter) operably linked to a reporter gene. The cell or animal can be a TERT promoter "knockout" or 
it can retain endogenous TERT promoter activity. The insertion of the TERT promoter-containing exogenous 
sequence is typically by homologous recombination between complementary nucleic acid sequences. Thus, 
the exogenous sequence, which is typically an hTERT or mTERT promoter of this invention, is some portion of 
the target gene to be modified, such as exon, intron or transcriptional regulatory sequences, or any genomic 
sequence which is able to affect the level of the target gene's expression; or a combination thereof. The 
construct can also be introduced into other locations in the genome. Gene targeting via homologous 
recombination in pluripotential embryonic stem cells allows one to modify precisely the genomic sequence of 
interest. 

In another embodiment, the introduced TERT promoter sequence (modified or wild type) can replace 
or disrupt an endogenous TERT promoter sequence. A newly introduced TERT promoter sequence can be 
engineered to have greater or lesser transcriptional activity, be responsive to new trans-acting transcriptional 
modulating agents, and the like. 

Disruption of an endogenous TERT promoter sequence typically will decrease or abrogate 
("knockout") the transcription of TERT. In one embodiment, the TERT promoter "knockout" is prepared by 
deletion or disruption by homologous recombination of the endogenous hTERT promoter. Homologous 
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recombination and other means to alter (and "knockout") expression of endogenous sequences is described in 
Moynahan (1996) Hum. Mol. Genet. 5:875; Moynahan (1996) Hum. Mol. Genet. 5:875; Baudin (1993) Nucl 
Acids Res. 21:3329; Wach (1994) Yeast 10:1793; Rothstein (1991) Methods Enzymol. 194:281- Anderson 
(1995) Methods Cel. Biol. 48:31; Pettitt (1996) Development 122:4149-4157; Ramirez-Solis (1993) Methods 
Enzymol. 225:855; Thomas (1987) Cell 51:503; Couldrey (1998) Dev. Dyn. 212:284-292). Holzschu (1997) 
Transgenic Res 6:97-106; U.S. patents 5,464,764; 5,631,153; 5,487,992; 5.627,059, and 5,272 071- WO 
91/09955; WO 93/09222; WO 96/29411; WO 95/31560; WO 91/12650. Vectors useful in TERT gene therapy 
can be viral or nonviral. They may comprise other regulatory or processing sequences. Lyddiatt (1998) Curr 
Opin Biotechnol 9:177-85. 

The invention provides for delivery of the expression systems into cells or tissues in vitro or ex vivo 
For ex vivo therapy, vectors may be Introduced into cells taken from the patient and clonally propagated for 
autologous transplant back Into the same patient (U.S. Patent Nos. 5,399.493 and 5,437.994. Cells that can be 
targeted for TERT promoter gene therapy aimed at increasing the telomerase activity of a target cell include 
but are not limited to. embryonic stem or germ cells, particularly primate or human cells, hematopoietic stem 
cells (AIDS and post-chemotherapy), vascular endothelial cells (cardiac and cerebral vascular disease) skin 
fibroblasts and basal skin keratinocytes (wound healing and bums), chondrocytes (arthritis), brain astrocytes 
and microglial cells (Alzheimer's Disease), osteoblasts (osteoporosis), retinal cells (eye diseases) and 
pancreatic islet cells (Type I diabetes). 

The exogenous sequence is typically inserted in a construct, usually also with a marker gene to aid in 
the detection of the knockout construct and/or a seiection gene. The knockout construct is inserted in a cell 
typically an embryonic stem (ES) cell, usually by homologous recombination. The resultant transformed ceil 
can be a single gene knockout (one haplotype) or a double gene (homozygous) knockout. The knockout 
construct can be integrated into one or several locations in the cell's genome due to the random nature of 
homologous recombination events; however, the recombination does occur between regions of sequence 
complementarity. Typically, less than one to five percent of the ES cells that take up the knockout construct will 
actually integrate exogenous DNA in these regions of complementarity; thus, identification and selection of cells 
w,th the desired phenotype is usually necessary and a selection or marker sequence is usually incorporated 
into the construct for this purpose. Cells which have incorporated the construct are selected for prior to 
inserting the genetically manipulated cell into a developing embryo; for example, the cells are subjected to 
positive selection (using G418. for example, to select for neomycin-resis.ance) and negative seiection (using 
for example. F.AU to exclude cells lacking thymidine kinase). Selection and marker techniques include 
anfb,o«c resistance selection or p-galactosidase marker expression as described elsewhere in this disclosure 

After selection of manipulated cells with the desired phenotype, such as complete or partial inability to 
express endogenous TERT promoter, or. expression of the exogenous TERT promoter (as hTERT promoter 
activity) the cells are inserted into a mouse embryo. Insertion can be accomplished by a variety of techniques 
such as microinjection, in which about 10 to 30 cells are collected into a micropipet and injected into embryos 
that are at the proper stage of development to integrate the ES cell into the developing embryonic blastocyst at 
about the eight cell stage, which for mice is about 3.5 days after fertilization. The embryos are obtained by 
perfusing the uterus of pregnant females. After the ES cell has been introduced into the embryo, it is implanted 
.nto the uterus of a pseudopregnant foster mother, which is typica.ly prepared by mating with vascectomized 
males of the same species. In mice, the optimal time to implant is about two to three days pseudopregnant 
Offspring are screened for integration of the TERT nucleic acid sequences and the modified promoter activity 
Phenotype. Offspring that have the desired phenotype are crossed to each other to genera.e a homozygous 
knockout. If it is unclear whether germline cells of the offspring have modified promoter, they can be crossed 
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with a parental or other strain and the offspring screened for heterozygosity of the desired trait. The 
heterozygotes can be crossed with each other to produce mice homozygous for modified TERT genomic 
sequence. Bijvoet (1998) Hum. Mol. Genet. 7:53-62; Moreadith (1997) J. Mot. Med. 75:208-216; Tojo (1995) 
Cytotechnology 19:161-165; Mudgett (1995) Methods Mol. Biol. 48:167-184; Longo (1997) Transgenic Res. 
5 6:321-328; U.S. Patents Nos. 5,616,491 (Mak, et al.); 5,464,764; 5,631,153; 5,487,992; 5,627,059; 5,272,071; 
and, WO 91/09955, WO 93/09222, WO 96/29411, WO 95/31560. and WO 91/12650. Thus, the invention 
provides for the use of the TERT promoter sequence-containing reagents of the invention to produce 
"knockout" mouse cells and animals, transgenic animals, and their progeny. These cells and animals can be 
further reconstituted with wild type or modified endogenous mTERT promoter or exogenous TERT promoter, 
10 such as hTERT. 

The present invention further provides methods and reagents for karyotype analysis, gene 
amplification detection, or other chromosomal analysis using probes comprising the TERT promoter sequences 
of the invention. In various embodiments, amplifications (change in copy number), deletions, insertions, 
substitutions, or changes in the chromosomal location (translocations) of TERT promoter containing genes are 

15 detected. These can be correlated with the presence of a pathological condition or a predisposition to 
developing a pathological condition (such as cancer). Thus, this information can be used in a diagnostic or 
prognostic manner. For instance, a translocation event could indicate that activation of TERT expression 
occurs in some cases by replacing all or part of the TERT promoter with another promoter element that directs 
TERT transcription in an inappropriate manner. Furthermore, the methods and reagents of the invention can 

20 be used to inhibit this inappropriate TERT activation. 

Determining the chromosomal location of TERT promoter sequence may also be useful for analysis of 
TERT gene repression in normal somatic cells, for instance, whether the location is part of non-expressing 
heterochromatin. Nuclease hypersensitivity assays for distinguishing heterochromatin and euchromatin are 
described in Wu (1979) Cell 16:797; Groudine (1982) Cell 30:131; Gross (1988) Ann. Rev. Biochem. 57:159. 

25 Methods for analyzing karyotype are discussed in Pinkel (1988) Proc. Natl. Acad. Sci. USA 85:9138; EPO Pub. 
No. 430,402; Choo, ed., Methods In Molecular Biology Vol. 33: In Situ Hybridization Protocols, Humana Press, 
Totowa, New Jersey, 1994; Kallioniemi (1992) Science 258:818). 

TERT promot er binding proteins and transcriptional regulatory factors 
30 ln addition to the novel TERT promoter sequences and identification of the cis-acting transcriptional 

regulatory sequences contained therein, the invention provides for novel in vitro and cell-based in vivo assay 
systems to screen for TERT promoter binding proteins (trans-acting transcriptional regulatory factors) using the 
nucleic acids of the invention. Many assays are available that screen for nucleic acid binding proteins and all 
can be adapted and used with the novel TERT sequences provided by the invention. 
35 0ne embodiment of the invention provides a method of screening and isolating a TERT promoter 

binding compound by contacting a TERT promoter sequence of the invention (particularly, an identified cis- 
acting regulatory sequence) with a test compound and measuring the ability of the test compound to bind the 
selected nucleic acid. The test compound, can be any agent capable of specifically binding to a TERT 
promoter activity, including compounds available in combinatorial libraries, a cell extract, a nuclear extract, a 
protein or peptide. If a TERT transcriptional activating protein is the goal of the search, a cell with telomerase 
activity is typically chosen. 

Various techniques can be used to identify polypeptides which specifically bind to TERT promoter; for 
example, mobility shift DNA-binding assays, methylation and uracil interference assays, DNase and hydroxyl 
radical footprinting analysis, fluorescence polarization, and UV crosslinking or chemical cross-linkers. For a 
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general overview, see Ausubel (chapter 12, DNA-Protein Interactions). One technique for isolating co- 
associating proteins, including nucleic acid and DNA/RNA binding proteins, includes use of UV crosslinking or 
chemical cross-linkers, including cleavable cross-linkers dithiobis (succinimidylpropionate) and 3.3 -dithiobis 
(sulfosuccinimidyl-propionate). McLaughlin (1996) Am. J. Hum. Genet. 59:561-569: Tang (1996) Biochemistry 
35:8216-8225; Lingner (1996) Proc. Natl. Acad. Sci. USA 93:10712; Chodosh (1986) Mo I. Cell. Biol 6:4723- 
4733. In many cases, there is a high likelihood that a specific protein (or a related protein) may bind to an 
hTERT promoter sequence, such as a Myc. NF-kappa B, EF2. Sp1, AP-1 or CAAT box binding site. In these 
scenarios, where an antibody may already be available or one can be easily generated, co-immunoprecipitation 
analysis can be used to identify and isolate TERT promoter-binding, trans-acting factors. The trans-acting 
factor can be characterized by peptide sequence analysis. Once identified, the function of the protein can be 
confirmed, for example, by competition experiments, factor depletion experiments using an antibody specific for 
the factor, or by competition with a mutant factor. 

Alternatively, TERT promoter-affinity columns can be generated to screen for potential TERT binding 
proteins. In a variation of this assay, TERT promoter subsequences are biotinylated, reacted with a solution 
suspected of containing a binding protein, and then reacted with a strepavidin affinity column to isolate the 
nucleic acid or binding protein complex (Grabowski (1986) Science 233:1294-1299; Chodosh (1986) supra). 
The promoter-binding protein can then be conventionally eluted and isolated. Mobility shift DNA-protein binding 
assay using nondenaturing polyacrylamide gel electrophoresis (PAGE) is an extremely rapid and sensitive 
method for detecting specific polypeptide binding to DMA (Chodosh (1986) supra, Carthew (1985) Cell 43:439- 
448; Trejo (1997) J. Biol. Chem. 272:27411-27421; Bayliss (1997) Nucleic Acids Res. 25:3984-3990). 

Interference assays and DNase and hydroxyl radical footprinting can be used to identify specific 
residues in the nucleic acid protein -binding site. Bi (1997) J. Biol. Chem. 272:26562-26572; Karaoglu (1991) 
Nucleic Acids Res. 19:5293-5300. Fluorescence polarization Is a powerful technique for characterizing 
macromolecular associations and can provide equilibrium determinations of protein-DNA and protein-protein 
interactions. This technique is particularly useful (and better suited than electrophoretic methods) to study low 
affinity protein-protein interactions. Lundblad (1996) Mol. Endocrinol. 10:607-612. 

Proteins identified by these techniques can be further separated on the basis of their size, net surface 
charge, hydrophobicity and affinity for ligands. In addition, antibodies raised against such proteins can be 
conjugated to column matrices and the proteins immunopurified according to well known methods. Scopes, R. 
K., Protein Purification: Principles and Practice, 2nd ed., Springer Verlag, (1987). 

Transcriptional regulatory sequences identified by comparison of hTERT and mTERT sequences 
include the for trans-acting factors c-Myc, SP1, SRY, HNF-3U, HNF-5, TFIID-MBP, E2F and c-Myb. Table 1 
shows other transcriptional regulatory sequences that have been identified upstream from the TERT encoding 
region by comparison of the hTERT sequence with known regulatory motifs. These elements are of interest in 
regulating transcription in the cell types where the factors that bind to these elements are present. 
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TABLE 1: Putative Recognition Elements Upstream from the hTERT Encoding Region 



Site; Name 


Position (relative to 
translation ^tarN 


FLANKING— RECOGNITION SEQUENCE' — FLANKING 

/omhoriripH in 9FD ID NO* It 

^cJHUtJUUCJU III OlU, IU INW. 1^ 


AP-2 /Rpv 
r\i voj / r\cv 


900*; 


GGGCA— ooouAuulr— ACGAG 


HiNF-A 
mini *r\ r\o 


/ o 


a i i i i ATTTAft^TATTT yattt 

Aim i— a i 1 1 ago i A i r i — tattt 


t^r\~WJl loci IsUb \£) 




TCTTG— G O I OAU 1 bOAA— CCTCC 


op i-ic-o. 1 


9710 


GTG AT— G wo GO C— - ACCTC 




.2717 


GATCC— GCCCACC 1 U— AGCCT 


HiNF-A R9 




/-% /-^ o/^T ATTTA AP^ATTT ta a a a 

GGCCT— A 1 1 I A AUG A 1 1 I— TAAAA 


EcR-consensus f2t 


-25QR 


A I ooA— O 1 1 WMM 1 1 1 GC— ^CTTT 


AP-1 CS3/Rev 


-2584 


LUUL 1 ■— 1 1 ML* 1 UA-njbAo 1 


C/EBP CS1 


-2555 


ATATT_TTf TfiTA AT— .TPTT A 


E2A CS 


-24R2 


CAGbb—boAuiU 1 G-"GGAGG 


Yurnncenci ic 

I l*^UIIOt3HOUO 


-2^1fi 
-ZO ID 


Y/^a«t POrTrrTATT ata/vt 

TCCAT— CwW 1 Ol/ 1 ACT— CTACT 


C/EBP CS2 


-2^1 ? 


ATCCC— 1 uL. 1 At* I TACTG 


L.ur\*uui loci loUa \^-f "*"V 




TCTAC— 1 Go Li A 1 1 GAGC — CCCTT 


AP-9 PQd 


997ft 
-44 /O 


TATCC— C CC CC C AGG G— GCAG A 


AP-2 CS4 


-2977 


ATCCC— CCUCCAouGG— CA GAG 


PEA3 RS 


-2241 


t^t/n ^> a ^> a Af2 /-» a A*rr» 

I \j 1 oo— Moo AAU— GAATG 


PEA3 CS 


-2241 


TGTGo— Au bAA G— GAATG 


KpratinnrvtP pnhanror /Rou 


-917A 


GTTGG — TTTGTTT — GTTTT 


HNF-5 


-917ft 

•4 I/O 


TO/^VT Iff ^ T ———— — 

TGGTT — I VjTTTGT — TTTGT 


Kpratinnrv/to cnhanror /Raw 


-9174 


GTTTG— TTTGTTT— TGTTT 


Keratinocvtp pnhanrpr /Rpu 

'<vl a III IUwj IC ClltlallLfCI flACV 


-91RQ 


TTTGT— 1 1 Ibl 1 1 — TGAGA 


C/EBP CS1 /Rpv 
witor vO i i r\cv 


-210/1 

^ I uo 


CTTGG— OI 1 AOIoCA— GCCTC 


INF.1 


-207*1 


G GTTC— A AG TG A— TTCTC 


GCN4 CS2 


-207 '4 


OTTCA^— AG 1 bA lid U— CTGCT 


Sd1-IE-4/5 


-202 R 


AGGCA— UCCGCC— ACCAT 


AP-2 CS4/Rev 


-1 Qfl^ 


AGACG— GGGGTGGGGG— TGGGG 


AP-2 CS 1 ! /Rpv 


-1QR7 


ATGTT— G G C C A G G C— TGGTC 


E2A CS 


-1 AAA 
- 1 ooo 


G GATT— AC* AG G TG— TG AGC 


PFA3 RQ 


-1R24 
- I oz*+ 


GAGG T— A G G A AG— CTCAC 


PEA3 CS 


-1R24 


GAG GT— AG G A AG— CTCAC 


NFI-NFI 


-17RR 
- 1 / oo 


1 1 1 1 A— AGCCAAT— GATAG 


CTF/NF-1 a 


-17RR 
- 1 # oo 


1 i » i A A^^/^A AT <^atao 

I I I lA— AGCCAAT— GATAG 


CTF/NF-1 b 


-17RR 

- 1 I oo 


TTTTA A/iPPAAT "ATA" 

TTTTA — AG C C A A T— GATAG 


PEA1 RS 


•1730 


TGTGA— TGACTAA— GACAT 


AP-1 CS3 


-1730 


TGTGA— TGACTAA— GACAT 


AP-1 CS4 


-1730 


TGTGA— TGACTAA— GACAT 


PEA3-uPA/Rev 


-1630 


AGGCG— TTTCCT— CGCCA 


C/EBP CS1 /Rev 


-1605 


TGTTA— ATTACTCCA— GCATA 


NF-E1 CS1 


-1594 


CCAGC— ATAATCTT— CTGCT 


Sp1-IE-3.1 


-1474 


CCAAA— CCGCCC— CTTTG 
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TABLE 1: Putative Recognition Elements Upstream from the hTERT Encoding Region 



Site Name 


Position: (relative to 
translation start) 


FLANKING— RECOGNITION SEQUENCE— FLANKING 
/embedded in SEQ ID NO- 1\ 


HNF-5 site 


-1442 




NFkB CS4 


-1404 


ArTAA^GGGfiATTTf!— TAriAA 


SIF-consensus 


-1384 


ApOf*A ftPPft.TA ATPCT 

ftoU«n"^wwU 1 M" - A 1 OL> 1 


AP-2 CS5 


-1319 


A A> ^ r>-T * r*r*r* a ftftftft TrTrr 


PEA3-uPA /Rev 


-1280 




PEA3 CS 


-1256 




HNF-5 CS 


-1215 


1 1 L.AG— lull) oO— CGACC 


HSTF CS2 


-1169 


^AfiAr— ftft A ft A AftTTTftTft ft nr+r*r*~r 


AP-2 CS5 


-970 


r*f*r*r* A_— ftftftVftftftft ir^r^ a 


Sp1 CS2 


-950 


Tr , T/^P— ftft ftftftft r»ATV"T 

1 is 1 bL— \a o buy GATGT 


SP1 CS3 


-950 


TGTGC— GG UCbG— GATGT 


E1A-F CS 


-946 


OuobL^uouA 1 o i— GACCA 


Sp1-IE-3.1 


-807 




AP-1 CS3 


-794 


CTHC r~* TftATTAA r>rAr 

G1GGG— 1 OA 1 1 AA^CAGAT 


AP-2 CS5 


-657 


r*-rf^^ > r > __ftftftTftr*r*r* nr^o » 

G I GO<j I ooUU— GTCCA 


SIF-consensus 


-652 




AP-2 CS4 


-620 


G I 10, G— I L»Lrl/l/AouOo — CGTCT 


GCF-consensus /Rev 


-552 


poppa PP^/^P^/^/^^T /-»#-»_»--»« 

C OCG A— ~0 G Utr uu GGT— CCGGA 


AP-2 CS5 


-531 


v» I GGA— *- ooi/AuV/vu— 'TGGGT 


Sp1-NPY 


-452 


CATGG— bUUu 1 UL»— CTCGG 


Yi-consensus 


-435 


PTTAP—ftftftAftAftftftT A /^>y-«r>^> 

GTTAC— -v»UV/AUAOCCT- -^GGCC 


AP-2 CS4/Rev 


-358 


G CGGC— GuG L» G GGCGG-— GGAAG 


Sp1 CS2 


-354 


CGCGC—GGGCGG— GGAAG 


SP1 CS3 




CGC G C— G G G CG G— GGAAG 


Sp1-IE-3.1 


~oz.o 


CGGGT — CCGCCC — GGAGC 


E2A CS 


-314 


CCGGA— GCAGCTG— CGCTG 


AP-2 CSS /Rev 




GTCG G — G G CC AG G C — CGGGC 


AP-2 CS5 


-297 


TCGGG — GCCAGGCC — GGGCT 


AP-2 CS5/Rev 


tea 


AGGCC-— GGGCTCCC— AGTGG 


c-Myc binding site 


-242 


L. 1 1 CO— OAUu 1 G— GCGGA 


AP-2 CS5/Rev 


-217 


fl&mr— ftftftftAftftft r*Tr*r*-r 


SIF-consensus 


-212 


bbGCA— ULUu 1 G— CTGCC 


Sp1-ras1.1 


-188 


TTCCA— GCTCCGCCTC — CTCCG 




- I £30 


TTCCA— GCTCCGCCTC — CTCCG 


Sp1 CS1 /Rev 


-168 


CGCGG— ACCCCGCCCC — GTCCC 


SP1-IE3/2/Rev 


-168 


CGCGG—ACCCCGCCCC— GTCCC 


GC-box {1)/Rev 


-168 


CGCGG— ACCCCGCCCC— GTCCC 


Sp1«junD 


-166 


CGGAC— CCCGCCCC— GTCCC 


Sp1-IE-3.1 


-165 


GGACC— CCGCCC— CGTCC 


SIF-consensus 


-161 


CCCGC— CCCGTC— CCGAC 
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TABLE 1: Putative Recognition Elements Upstream from the hTERT Encoding Region 


Site Name 


Position (relative to 
translation start) 


FLANKING — RECOGNITION SEQUENCE' — FLANKING 

(embedded in SEQ. ID NO: 1) 


Sp1-NPY 


-151 


CCCGA— CCCCTCC— CGGGT 


Sp1-NPY 


-127 


CCAGC— CCCCTCC— GGGCC 


Sp1-NPY 


-108 


CCCAG— CCCCTCC— CCTTC 


GCF-consensus /Rev 


-88 


TCCGC— GGCCCCGCCC— TCTCC 


Yi-consensus 


-85 


GCGGC— CCCGCCCTCT— CCTCG 


Sp1-IE-3.1 


-84 


CGGCC— CCGCCC— TCTCC 


c-Myc binding site 


-34 


CTGCG— CACGTG— GGAAG 


AP-2 CS5/Rev 


-13 


GCCCC— GGCCACCC— CCGCG 



The examples and detailed elaboration provided in this disclosure are for illustrative purposes, and are 
not intended to limit the invention. Modifications can be made by those skilled in the art that are included within 
the spirit of this application and scope of the appended claims. 



EXAMPLES 

Example 1 : Cloning of XG(p5 and characte rization of hTERT genomic sequences 
The following example details the cloning of the human hTERT promoter. 

A human genomic DNA library was screened by PCR and hybridization to identify a genomic clone 
containing hTERT RNA coding sequences. The library was a human fibroblast genomic library made using 
DNA from WI38 lung fibroblast cells (Stratagene, Cat # 946204). In this fibroblast library, partial Sau3AI 
fragments were ligated into the Xhol site of a commercial phage cloning vector, Lambda FIX®. Vector 
(Stratagene, San Diego, CA), with insert sizes ranging from approximately 9 kilobases (kb) to 22 kb. 

The genomic library was divided into pools of 150,000 phage each. Each pool screened by nested 
PCR, with the outer primer pair TCP1.52 & TCP1.57; inner pair TCP1.49 & TCP1.50. These primer pairs span 
a putative intron in the genomic DNA of hTERT and ensured the PCR product was derived from a genomic 
source and not from contamination by the hTERT cDNA clone. Positive pools were further subdivided until a 
pool of 2000 phage was obtained. This pool was plated at low density and screened via hybridization with a 
DNA fragment encompassing a subset of hTERT cDNA, generated by restriction digest with Sphl and EcoRV. 

Two positive clones were isolated and rescreened via nested PCR. At rescreening, both clones were 
positive by PCR. One of the lambda phage clones (designated "Gphi5 tt or "XGcpS*') was digested with Notl, 
revealing an insert size of approximately 20 kb. Subsequent mapping indicated the insert size was 15 kb and 
that phage XGq>5 contains approximately 13 kb of DNA upstream from the transcriptional start site (upstream 
from the cDNA sequence). 

Figure 1 shows the structure of Phage A.Gcp5, mapped by restriction enzyme digestion and DNA 
sequencing. 

Isolating, Subcloning and Sequencing the Genomic hTERT Insert 

The phage DNA was digested with Ncol. This fragment was cloned into the plasmid pBBS167. The 
resulting subclones were screened by PCR to identify those containing sequences corresponding to the 5' 
region of the hTERT cDNA. A subclone (plasmid H pGRN140 M ) containing a 9 kb Ncol fragment (with hTERT 
gene sequence and about 4 to 5 kb of lambda vector sequence) was partially sequenced to determine the 



10 



15 



WO 00/46355 PCT/USOO/03104 
orientation of the insert. pGRNUO was digested using Sail to remove lambda vector sequences, the resulting 
plasmid (with removed lambda sequences) designated pGRN144. The pGRN144 insert was then sequenced. 

A Notl fragment from \G<?5 (containing the complete approximately 15 kbp genomic insert including 
the hTERT gene promoter region) was inserted in the Notl site of plasmid pBBS185. Two plasmids were 
isolated with their respective inserts oriented in opposite directions. One resulted in the insert oriented with the 
hTERT open reading frame (ORF) in the same orientation as the plasmid's Lac promoter, designated pGRN 
142; the second, pGRN 143. 

SEQ. ID NO:1 is a listing of the sequence data obtained from plasmid pGRN 142. Nucleotides 1-43 
and 15376-15418 are plasmid sequence. Thus, the genomic insert begins at residue 44 and ends at residue 
15375. The beginning of the cloned cDNA fragment corresponds to residue 134go. There are Alu sequence 
elements located -1700 base pairs upstream. The sequence of the hTERT insert of pGRN 142 can now be 
obtained from GenBank (http://www.ncbi.nlm.nih.gov/) under Accession PGRN142.INS AF121948. 

Numbering of hTERT residues for plasmids in the following examples begins from the translation 
initiation codon, according to standard practice in the field. The hTERT ATG codon (the translation initiation 
site) begins at residue 13545 of SEQ. ID NO:1. Thus, position -1, the first upstream residue, corresponds to 
nucleotide 13544 in SEQ. ID NO:1 . 
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Example 2: TERT Promoter -Driven Reporter Construes 

This example describes the construction of plasmids in which reporter genes are operably linked to 
20 hTERT promoter sequences of the invention. This also illustrates how the TERT promoter sequence of the 
invention can analogously be operatively linked to heterologous sequences, such as polypeptide coding 
sequences, for expression in cells and tissues in vitro and in vivo and transgenic animals. As will be evident to 
one skilled in the art. techniques such as those illustrated in these examples can be used to test other 
candidate sequences for ability to specifically promote transcription in cells expressing TERT. 

hTERT-linked reporter vectors of the invention have numerous uses, including identification of specific 
cis-acting sequences and trans-acting transcriptional regulatory factors. Importantly, these hTERT-containing 
reporter constructs can be used for the screening of agents capable of modulating (i.e.. activating or inhibiting) 
hTERT transcription. These studies can be conducted in vitro and in vivo. 

A number of reporter genes, such as firefly luciferase. B-glucuronidase, p-galactosidase, 
chloramphenicol acetyl transferase, and GFP are known and can be operably linked to hTERT promoter. In 
this example, the human secreted alkaline phosphatase (SEAP; ClonTech) was used. The SEAP reporter 
gene encodes a truncated form of the placental enzyme which lacks the membrane anchoring domain, thereby 
allowing the protein to be secreted efficiently from transfected cells. Levels of SEAP activity detected in the 
culture medium have been shown to be directly proportional to changes in intracellular concentrations of SEAP 
35 mRNA and protein. The chemiluminescence-based SEAP assay is about 10-fold more sensitive than similar 
assays using firefly luciferase as the reporter enzyme. The SEAP activity can also be assayed with a 
fluorescent substrate, which provides sensitivity comparable to luciferase. Berger (1988) Gene 66:1; Cullen 
(1992) Meth. Enzymol. 216:362; Yang (1997) Biotechniques 23:1 110-1 114. 

40 hTERT 5' Upstream and Intmn Sequences have "Promoter" Activity 

Experiments with reporter constructs comprising various hTERT sequences of the invention identified 
cis-acting regions with "promoter- transcriptional activating activity in both 5' upstream and intron sequences 
In brief, four constructs, pGRN148, pGRN150, «pSEAP2 basic" (no promoter sequences = negative control) 
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and H pSEAP2 control" (contains the SV40 early promoter and enhancer) were constructed and transfected in 
triplicate into mortal and immortal cells. 

Figure 2 shows the plan for construction of plasmid pGRN148. Briefly, a Bgl2-Eco47lll fragment from 
PGRN144 (described above) was digested and cloned into the Bglll-Nrul site of pSeap2Basic (ClonTech, San 
5 Diego, CA). A second reporter-promoter, plasmid pGRN150 was made by inserting the Bglll-Fspl fragment 
from pGRN144 into the Bglll-Nrul sites of pSEAP2. Plasmid pGRN173 was constructed by using the EcoRV- 
Stul fragment from pGRN144. This makes a promoter reporter plasmid that contains the promoter region of 
hTERT from approximately 2.5 kb upstream from the start of the hTERT ORF to just after the first intron within 
the coding region. The initiating Met was mutated to Leu, so that the second ATG following the promoter region 

10 would be the initiating ATG of the SEAP ORF. 

Use of the intron sequence allows identification of regulatory sequences that may be present in the 
intron (the invention provides transcriptional regulatory sequences from any portion of the hTERT genomic 
sequence). In addition to the hTERT derived pSEAP reporter constructs, a positive control vector and a 
negative control vector were used. The negative control (pSEAP2-Basic) is necessary to determine the 

15 background signal associated with the DNA backbone of the vector. A positive control is necessary to confirm 
transfection and expression of exogenous DNA and to verify the presence of active SEAP in the culture media. 
The positive control is the pSEAP2-Control vector (ClonTech) which contains the SEAP structural gene under 
transcriptional control of the SV40 promoter and enhancer. 

Three constructs, the control, pGRN148 (which include hTERT 5' promoter sequences) and 

20 pGRN150, were transfected into a mortal cell line, BJ cells, a human foreskin fibroblast line, Feng (1995) 
Science 269:1236; and an immortal cell line, the human embryonic kidney line 293; Graham (1977) J. Gen. 
Virol. 36:59. Ail transfections were done in parallel with the two control plasmids. 

In immortal cells, pGRN148 and pGRN150 constructs appear to drive SEAP expression as efficiently 
as the pSEAP2 positive control (containing the SV40 early promoter and enhancer). In contrast, in mortal cells 

25 only the pSEAP2 control gave detectable activity. Similar results were obtained using another normal cell line 
(RPE, or retinal pigmental epithelial cells; Aronson (1983) In vitro 19:642-650). In RPE cells transfected with 
pGRN150, the hTERT promoter region was inactive while the pSEAP2 control plasmid was active. These 
results indicate that, as expected, hTERT promoter sequences are active in tumor cells but not in mortal cells. 

30 Identification of the Tissue Specificity Elements of the hTERT Promoter 

The hTERT DNA promoter sequences were cloned into the pSEAP2-Basic transcription reporter 
vector (ClonTech) to generate the plasmids pGRN 148, 150, 175, 176, 181,184, 261, 262, and 319. 
Summarized below are details of the promoter plasmid construction (nucleotide numbers refer to the number of 
nucleotides upstream of the translation initiation site at 13545 of SEQ ID NO:1): 
35 pEGFP-1 . "Vector from ClonTech containing the Enhanced Green Fluorescent Protein. 

pGRN140. *NC01 fragment containing hTERT upstream sequences and the first intron of hTERT 
from XG(p5 into the NC01 site of a pBBS167 (variant of pUC19 cloning vector with MCS, e.g. 
ATGACCATGATTACGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCATGCCCATG 
GCAGGCCTCGCGCGCGAGATCTCGGGCCCAATCGATGCCGCGGCGATATCGCTCGAGGAAGCTTGGCA 
40 CTGGCC (SEQ ID NO:3) , and a chloramphenicol sensitive gene between the F1ori and the Amp gene in the 
opposite orientation from the Amp gene). The fragment is oriented so that the hTERT sequences are in the 
same direction as the Lac promoter. 

pGRN144. described above; Sail deletion of pGRN140 to remove phage (lambda) sequences. 
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PGRN148: 'BGL2-EC047NI fragment from pGRN144 containing hTERT upstream sequences (from 
position -51 to -2482) into the BGL2-NRU1 sites of pSEAP2-Basic to make a hTERT promoter/reporter plasmid. 

PGRN150: 'BGL2-FSP1 fragment from pGRN144 containing 2447nt of hTERT upstream sequences 
(from position -36 to -2482) into the BGL2-NRU1 sites of pSEAP2 to make a hTERT promoter/reporter plasmid. 
5 pGRN175: *APA1(Klenow blunt)-SRF1 religation of pGRN150 to delete most of the hTERT upstream 

sequences. This makes a promoter/reporter plasmid that uses 82 nucleotides of hTERT upstream sequences 
(from position -36 to -117). 

PGRN176: 'PML1-SRF1 religation of pGRN150 to delete most of the hTERT upstream sequences. 
This makes a promoter/reporter plasmid that uses 204 nucleotides of hTERT upstream sequences (from 
10 position -36 to -239). 

PGRN181: *APA1 digestion and religation of pGRN150 to delete all APA1 sites but one. This makes 
a promoter/reporter plasmid that comprises from -36 to -114 and -1076 to -2482 of the hTERT upstream 
sequences. 

PGRN184: *XBA1 (partial, Klenow fill)-ECOR1 digest and religation of pGRN150 to make a deletion of 
15 the hTERT promoter sequences. This makes a promoter/reporter plasmid that expresses a region from -1391 
to -2484 of the hTERT upstream sequences. 

PGRN213. *FSP1 fragment containing the Cats gene and the F1 ORI plus part of the AmpR gene into 
the FSP1 sites of pSEAP2-Basic such that the orientation reconstructs the AmpR gene. 

PGRN244: *SAL1-NOT1 fragment from pSEAP2-Basic containing the SEAP region into the SAL1- 
20 NOT1 sites of pEGFP-1 . This modification adds a selectable marker to the vector. 

PGRN245: 'SAL1-NOT1 fragment from pGRN 176 containing the hTERT-promoter/SEAP region into 
the SAL1-NOT1 sites of pEGFP-1 . This modification adds a dominant selectable marker to the vector. 

PGRN246: 'SAL1-NOT1 fragment from pGRN176 containing the hTERT-promoter/SEAP region into 
the SAL1-NOT1 sites of pEGFP-1. This modification adds a dominant selectable marker to the vector. 
25 pGRN248 *SAL1-NOT1 fragment from pGRN175 containing the hTERT promoter/SEAP region into 

the Sall-Notl sites of pEGFP-1. This modification adds a dominant selectable marker to the vector. 

PGRIM259. *in vitro mutagenesis using RA94 (CCCGGCCACCCCCGCGAattCGCGCGCTCCCCG 
CTGC) (SEQ ID NO:4) to introduce an EcoRI site at the initiating met of hTERT in pGRN144. This provides 
hTERT sequences from +1 to -2482 that can be cloned into a vector using EcoRI and Bglll. 
30 pGRN260. *in vitro mutagenesis using RA91 (TTGTACTGAGAGTGCACCATATGCGGTGTGcatgc 

TACGTAAGAG GTTCC AACTTTC AC CATAAT) (SEQ ID NO:5) to delete several sites from the Chloramphenicol 
region of pGRN213 to create a variant, more useful, MCS. This creates a Mutagenesis version of pSEAP2- 
Basic with more unique cloning sites in its MCS. 

PGRN261: *BGL2-ECOR1 fragment from pGRN259 containing hTERT upstream sequences into the 
35 BGL2-ECOR1 sites of pSEAP2-Basic. This makes a promoter/reporter expression plasmid that contains from 
+1 to -2482 of the hTERT upstream sequences. 

PGRN262: *BGL2-ECOR1 fragment from pGRN259 containing hTERT upstream sequences into the 
BGL2-ECOR1 sites of pGRN260. This makes a promoter/reporter expression and mutagenesis plasmid that 
contains from +1 to -2482 of the hTERT upstream sequences. 
40 pGRN294. *Bbsl-Xhol fragment from pGRN142 containing hTERT upstream sequences from -1667 to 

-3278 into the Bbsl-Xhol sites of pGRN259. This makes a vector containing the genomic upstream region for 
hTERT from +1 to -3278 that can be cloned with EcoRI and Xhol. 
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PGRN295: *ECOR1-XH01 fragment from pGRN294 containing from +1 to -3282 of hTERT upstream 
sequences into the ECOR1-XH01 sites of pGRN260. This makes a SEAP promoter/reporter/muta genesis 
plasmid. 

PGRN296: *ECOR1-XH01 fragment from pGRN294 containing from +1 to -3282 of the hTERT 
5 upstream sequences into the ECOR1-XH01 sites of pSEAP2-Basic. This makes a SEAP promoter/reporter 
plasmid. 

PGRN297. *RA96 (AATTGCGAAGCTTACG) (SEQ ID NO:6) and RA97 (AATTCGTAAGCTTCGC) 
(SEQ ID NO:7) annealed to make an oligo linker into the ECOR1 sites of pGRN259 replacing the ECOR1 
fragment of the intron-exon region of pGRN259. 
10 PGRN299: *XH01-H1ND3 fragment from pGRN298 containing from +1 to -3282 of the hTERT 

upstream sequences into the XH01-HIND3 sites of pGL2-Basic. This makes a Luciferase promoter/reporter 
plasmid with about 3.3Kb of hTERT promoter sequences. 

pGRN300: *XH01-SAC1 fragment from pGRN142 containing hTERT upstream sequences into the 
XH01-SAC1 sites of pGRN299 such that the resulting construct contains from +1 to -5124 of the hTERT 
15 upstream sequences. This creates an hTERT promoter/reporter construct using Luciferase as a reporter. 

PGRN310: *SAC1 fragment from pGRN142 containing hTERT upstream sequences into the SAC1 
site of pGRN300 such that the resulting construct contains +1 to -7984 of the hTERT upstream sequences. 
This creates an hTERT promoter/reporter construct using Luciferase as a reporter. 

pGRN311. *SPE1 fragment from pGRN142 containing from -4773 to -13501 of the hTERT upstream 
20 sequences into the SPE1 site of pGRN300 such that the orientation reconstructs the genomic region. This 
makes a Luciferase promoter reporter plasmid that contains the entire pGRN142 upstream genomic region of 
hTERT plus a 365bp region of genomic DNA from the middle of the 13.5Kb genomic region repeated upstream 
of the T7 promoter. 

pGRN312: *BGL2-FSP1 fragment from pGRN144 into the BGL2-HIND3 (Klenow filled) sites of pGL2- 
25 Basic. This makes a Luciferase promoter/reporter version of pGRN1 50. 

pGRN313: *KPN1-NOT1 digested pGRN311 blunted with T4 polymerase and religated. This makes a 
Luciferase promoter/reporter plasmid using from +1 to -13501 of the hTERT upstream sequences. 

pGRN316: *o!igo RA101 (5*- TAGGTACCGAGCTCTTACGCGTGC TAGCCCCACGTGGCGGA 
GGGACTGGGGACCCGGGCA-3') (SEQ ID NO:8) used for in vitro mutagenesis to delete the genomic 
30 sequence from pGRN262 between the SRF1 site and the first PML1 site. This makes a promoter- reporter 
plasmid containing hTERT upstream sequences from +1 to -239. 

pGRN317: *oligo RA100 (5'-TAGGTACCGAGCTCTTACGCGTGCTAGCCCCTCGCTGGCGTCCCT 
GCACCCTGGGAGCGC-3') (SEQ ID NO:9) used for in vitro mutagenesis to delete the genomic sequence from 
pGRN262 between the SRF1 site and next to the last APA1 site. This makes a promoter -reporter plasmid 
35 containing hTERT upstream sequences from +1 to -397. 

pGRN319: 'RA107 (S'-CGTCCTGCTGCGCACtcaGGAAGCCCTGGCCCC-S') (SEQ ID NO:10) used 
for in vitro mutagenesis to inactivate the 'B' class E-box just proximal to the hTERT initiating met in pGRN262. 
This changes the CACGTG (SEQ ID NO:11) to CACTCA (SEQ ID NO:12). Also COD1941 (5*- 
GATGAATGCTCATGATTCCGTATGGCA-3') (SEQ ID NO:13) was used to switch from CatR to CatS 
40 introducing a BSPH1 site and COD2866 (5'-CAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGCGCAAAA 
ACAGGAAGGCAAAATGCC-3') (SEQ ID N0:14) was used to select from AmpS to AmpR introducing an FSP1 
site. In summary, pGRN31 9 carries a mutation in the E-box. 

pGRN350: *RA104 (5'- TAGGTACCGAGCTCTTACGCGTGCTAGCCCCTCCCAGCCCCTC CCCT 
TCCTTTCCGCGGC-3') (SEQ ID NO: 15) used for in vitro mutagenesis to delete the genomic sequence from 
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PGRN262 between the SRF1 site and the last APA1 site before the ATG of the hTERT open reading frame 
(orf). This makes a promoter- reporter plasmid containing hTERT upstream sequences from +1 to -1 17. 

pGRN351: *SAC2 fragment from pGRN319 into the SAC2 sites of pGRN350 such that the SEAP orf 
is recreated. This makes a "deactivated E-box" version of pGRN350. 
5 pGRN352: *RA122 (5'- GACCGCGCTTCCCACtcaGCGGAG GGACTGGGG-3') (SEQ ID NO:16) 

used for in vitro mutagenesis to "deactivate" the penultimate class "B" E-box before the translation start site of 
hTERT. 

The pSEAP2-Basic plasmid lacks eukaryotic promoter and enhancer sequences. This vector contains 
the SV40 late polyadenylation signal inserted downstream of the SEAP coding sequences to ensure proper and 

10 efficient processing of the transcript in eukaryotic cells. It also contains a synthetic transcription blocker (TB), 
composed of adjacent polyadenylation and transcription pause sites to reduce background transcription. As 
noted above, the SEAP reporter gene encodes a truncated form of the placental enzyme which lacks the 
membrane anchoring domain, thereby allowing the protein to be efficiently secreted from transfected cells. 

Levels of SEAP activity detected in the culture medium have been shown to be directly proportional to 

15 changes in intracellular concentrations of SEAP mRNA. The chemiluminescent SEAP substrate CSPDTM 
(ClonTech) was used to detect secreted SEAP. Use of this substrate enables monitoring of the expression of 
the SEAP reporter gene through simple, sensitive, non-radioactive assays of secreted phosphatase activity. 
This chemiluminescent assay can detect as little as 10-13 g of SEAP protein. The assay is linear over a 104 
fold range of enzyme concentrations. This makes the assay (and these vectors) particularly well-suited for 

20 comparative analyses. 

In addition to the hTERT derived pSEAP reporter constructs, a positive control vector (pSEAP2- 
Control vector) and a negative control vector (pSEAP2-Basic) were used. The promoter constructs (pGRN 150, 
175,176) and the control vectors were transfected into immortal (HEK 293) and mortal (BJ fibroblast, RPE, 
HUVEC) cells 48-72 hours after transfection. The culture media was collected and assayed for SEAP activity. 

25 The SEAP activity was detected using the chemiluminescent assay from CLONTECH, Great EscAPeTM SEAP 
Chemiluminescence Kit, according to the manufacturer's protocol. The transfections were performed in 
triplicate. The culture media from each transfection was collected after 48-72 hours and assayed in triplicate. 
The background values obtained by transfection of the negative control (pSEAP2-Basic) vector was subtracted 
from the values obtained with the test constructs. The average of nine measurements was used and plotted for 

30 each of the constructs. 

Experimental Results in Immortal and Mortal Cell Lines 

The results of the assays show that while the hTERT promoter constructs are capable of driving the 
expression of the reporter SEAP gene in immortal cells, the same constructs are silent in all mortal cells tested. 
35 The pSEAP2-Control vector however is active in ail cell types regardless of their mortal or immortal status and 
the pSEAP2-Basic vector is silent in all cells assayed. 

hTERT Promoter Driving Thymidine Kinase Expression In vitro 

The invention provides constructs comprising heterologous coding sequences operably linked to 
40 hTERT promoter sequences. In one embodiment, hTERT coding sequences are operably linked to Herpes 
simplex virus thymidine kinase f HSV-TK") coding sequences. HSV-TK is an enzyme that is capable of 
converting innocuous prodrugs, e.g. ganciclovir, into toxic metabolites that interfere with the cellular replication 
of proliferating cells (such as cancer cells, which have active hTERT promoter activity). Controlling thymidine 
kinase (TK) expression by subordinating it to the hTERT promoter restricts TK expression to cells where the 
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hTERT promoter is normally active. This prevents TK expression in "normal" cells, where the hTERT promoter 
is usually silent. 

The ability of the hTERT promoter to specifically drive the expression of the TK gene in tumor cells 
was tested using a variety of constructs: One construct, designated pGRN266, contains an EcoRI-Fsel PCR 
fragment with the TK gene cloned into the EcoRI-Fsel sites of pGRN263. pGRN263, containing approximately 
2.5 kb of hTERT promoter sequence, is similar to pGRN150, but contains a neomycin gene as selection 
marker. pGRN267 contains an EcoRI-Fsel PCR fragment with the TK gene cloned into the EcoRI-Fsel sites of 
pGRN264. pGRN264, containing approximately 210 bp of hTERT promoter sequence, is similar to pGRN176, 
but contains a neomycin gene as selection marker. pGRN268 contains an EcoRI-Xbal PCR fragment with the 
TK gene cloned into the EcoRI-Xbal (unmethylated) sites of pGRN265. pGRN265 f containing approximately 90 
bp of hTERT promoter sequence, is similar to pGRN175, but contains a neomycin gene as selection marker. 

These hTERT promoter/TK constructs, pGRN266, pGRN267 and pGRN268, were re-introduced into 
mammalian cells and TK/+ stable clones (and/or mass populations) were selected. Ganciclovir treatment in 
vitro of the TK/+ cells resulted in selective destruction of all tumor lines tested, including 143B, 293, HT1080, 
Bxpc-3, DAOY and NIH3T3. Significantly, ganciclovir treatment had no effect on normal BJ cells. This clearly 
demonstrates the tumor-specificity of all three hTERT promoter fragments used in these experiments. 

Example 3: Direct In vivo hTERT Promoter Suicide Gene Therapy 

The invention provides reagents and methods for treating diseases involving unwanted cell 
proliferation by in vivo gene therapy. To demonstrate the efficacy of this aspect of the invention, the reagents 
of the invention were used to treat cancer (of human origin) in an art-accepted animal model. A human cancer 
cell, the osteosarcoma cell line 143B, which normally expresses the telomerase gene, was transfected with a 
plasmid containing the TK gene driven by the hTERT promoter. 

Specifically, sequences -36 to -2482 upstream of the translation start site of SEQ ID NO:1 were used 
to drive the TK gene. The plasmid also contained the neomycin phosphotransferase gene. After transfection 
of cells with the plasmid, G418 resistant clones expressing TK were selected. Two hundred thousand of the 
parental or TK expressing 143B cells were injected subcutaneously in the flank of Balb/c nude (nu/nu) mice to 
establish tumors. Four to 1 1 days after tumor implantation the mice were injected IP with 75 mg/kg ganciclovir 
(GCV) or saline twice daily. Tumor growth was monitored every 3-4 days. When GCV was administered either 
at 4 or at 11 days post tumor implantation to these tumor bearing animals, TK mediated cell lysis and retarded 
tumor growth was observed. Such inhibition of tumor cell growth is not observed when saline is administered 
or if the parental 143B tumor (143BP) is treated with either saline or GCV. Forty-five days after tumor 
implantation, only the animals implanted with the TK+ 143B done and treated with GCV showed 100% survival. 
In the other groups all but one animal died from massive tumor burden. 

These data indicate that the hTERT promoter is sufficient to drive TK gene expression both in vivo. It 
also shows that the reagents and methods of the invention can be used to promote tumor regression in vivo in 
subjects (including humans) carrying pre-established tumors. 

Example 4: Oncolytic Viruses Under Control of the hTERT Promoter 

As discussed earlier the invention provides "conditionally replicating" oncolytic virus constructs in 
which hTERT promoter sequences of the invention are operably linked to essential virally encoded genes. Use 
of hTERT promoter sequences of the invention ensures the virus will only be productively expressed in cells 
with telomerase activity. Thus, constructs can be used therapeutically to lyse only cells that express 
telomerase, such as immortal or cancer cells. Proliferation of the virus and its cytopathic effects is thus 
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restricted to tumor cells. Details of the construction of an exemplary hTERT promoter driven, conditionally 
replicating oncolytic virus follows, In this embodiment, the hTERT promoter replaces the normal E1a promoter 
to create a virus which will only replicate in telomerase expressing cells. 

Plasmid pBR/ITR/549-Clal containing nucleotides 1-356 (Ad2 ITR and packaging signals) and 549- 
920 (a portion of the E1a coding sequence) of Adenovirus 2 (Ad2) linked using a polylinker was built using 
standard molecular biology procedures in the bacterial plasmid pBR322. In pBR/ITR/TB+phTERT1 76-E1 A and 
pBR/ITR/TB+phTERT316-E1A, the normal E1a promoter (Ad2 357-548) has been replaced with the hTERT 
promoter. Ad2 sequences from 916-10680 are added to these plasmids to recreate the expression elements 
of the 5' end of the virus. 

These plasmids (pBR/ITR/TB+phTERT1 76-1 0680 and pBR/ITR/TB+phTERT31 6-1 0680) are 
transfected into a telomerase expressing human cell line along with an adenoviral DNA fragment containing 
Ad2 sequences 10681- 35937. Recombinant plaques are scored and selected 7-21 days post transduction. 
The hTERT promoter E1a containing Ad2 is propagated and produced for use employing standard schemes for 
recombinant Ad2 amplification and manufacturing. (Graham and Prevec, 1991, in Methods in Molecular 
Biology, Chapter 11, Ed E.J. Murray, The Human Press Inc., Clifton, NX; Kanegae et al„ Jpn J Med Sci Biol, 
1994, 47(3) :157-66). Because the E1a gene is driven by the hTERT promoter, which is not normally expressed 
by most somatic cells, recombinant Ad2 genome will only replicate and be packaged into virus particles in cells 
expressing telomerase. 

20 Example 5: hTERT Prom oter Sequences Driving an Alkaline Phosphatase Reporter Gene for High Throughp ut 
Screening. 

The invention provides constructs and promoter-based assays to identify small molecule activators 
and/or repressors of hTERT and telomerase activity. To this end, fragments of the hTERT promoter were 
cloned into plasmids expressing a secreted form of alkaline phosphatase and a selection marker. The SEAP 

25 constructs (pGRN244, pGRN245, pGRN246 and pGRN248) were re-introduced into normal human cells and 
into immortal cell lines. After selection of stable clones having integrated the hTERT promoter/SEAP 
constructs, RT-PCR was used to determine the levels of SEAP mRNAs. In 293 cells, the levels of SEAP 
mRNA were elevated and comparable to the levels of endogenous hTERT, whereas in BJ cells, the levels of 
SEAP mRNA were virtually undetectable and closely matched the levels of the endogenous hTERT in these 

30 cells. 

These results indicate that hTERT promoter/SEAP constructs can be used to engineer cells suitable 
for promoter-based assays and to screen for chemical and/or biological activators and/or repressors of 
telomerase in normal and tumor cells. pGRN244, pGRN245, pGRN246 and pGRN248 were re-introduced into 
BJ and 293 cells. SEAP activity and mRNA levels were determined in these cells as criteria for clone selection. 

35 Several 293 and BJ lines were selected and two BJ/pGRN245 clones were expanded for high throughput 
screening. These constructs were also introduced into IDH4 cells, which are immortal lung fibroblasts that 
express the SV40 large T antigen under the control of the dexamethasone-inducible MMTV promoter. IDH4 
cells are telomerase positive and proliferate in the presence of dexamethasone. However, these cells can be 
induced into a senescent, telomerase negative stage after dexamethasone removal. Upon re-addition of 

40 dexamethasone, the celts return to an immortal phenotype and re-activate telomerase. 

PGRN244, pGRN245, pGRN246 and pGRN248 were transfected into IDH4 cells. SEAP activity was 
shown to parallel telomerase activity in the different clones, whereas no significant fluctuation of SEAP activity 
was observed with the control plasmid. These results indicate that a fragment of approximately 2.5 kb of 
hTERT promoter sequence (pGRN245) contains sufficient sequence elements to support both activation and 
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repression in response to proliferation and/or growth arrest stimuli that control telomerase activity in IDH4 cells. 
Two clones, ID245-1 and ID245-16 whose SEAP profile closely matched telomerase activity during drug 
treatment, were selected and expanded for high throughput screening of small molecule activators of 
telomerase. 

5 

Example 6: hTERT Promoter Sequences Driving a B-oalactosidase Reporter Gene to Identify Biological 
Regulators of hTERT and Telomerase activity. 

The invention also provides constructs and promoter-based assays to identify biological modulators of 
hTERT and telomerase activity. An exemplary construct of this aspect of the invention is pGRN353 containing 

10 a Bglll-Hindlll fragment from pGRN297 with approximately 2.5 kb of hTERT promoter sequences cloned into 
the Bglll-Hindlll sites of B-gal-Basic (ClonTech). pGRN353 or similar constructs are re-introduced into BJ cells 
by co-transfection with a plasmid containing a hygromycin gene as selection marker. Clonal cell lines and/or 
mass populations are established and used to screen retroviral based cDNA libraries for genes or fragments of 
genes that can activate the hTERT promoter. pGRN353 or similar constructs are also re-introduced into 143B 

15 and 293 cells to screen retroviral libraries to identify sequences that can repress the hTERT promoter. 

Example 7: Identifying Trans-Acting Transcriptional Regulatory Elements 

The promoter-reporter (and other) vectors of the invention are also used to identify trans-acting 
transcriptional regulatory elements. As noted supra, plasmids in which reporter genes are operably linked to 

20 hTERT promoter sequences are extremely useful for identification of trans-acting transcriptional modulatory 
agents and for the screening of potential hTERT promoter-modulating drugs (including biological agents and 
small molecules). Both transient and stable transfection techniques can be used. In one embodiment, stable 
transformants of pGRN148 are made in telomerase negative and telomerase positive cells by cotransfection 
with a eukaryotic selectable marker (such as neo), according to Ausubel, supra. 

25 The resulting cell lines are used for screening of putative telomerase trans-acting transcriptional 

modulatory agents, for example, by comparing hTERT-promoter-d riven expression in the presence and 
absence of the test compound (the putative trans-acting transcriptional modulating agent). Additional promoter- 
reporter vectors (including the constructs described herein, as variations thereof) are similarly used to identify 
and isolate trans-acting factors binding to cis-acting transcriptional regulatory elements, such as, Myc, Sp1, 

30 TATA box binding protein, AP-1, CREB, CAAT binding factor and factors binding to hormone response 
elements (e.g., GRE). The identification and isolation of such trans-acting regulatory sequences provide for 
further methods and reagents for modulating the transcription and translation of telomerase. 

Example 8: c- Mvc acts as a Potent Activator of the TERT Promoter bv Direct Interaction with Cis-Actino 

35 Regulatory Seouences 

Use of recombinant constructs comprising TERT promoter sequences of the invention has, for the first 
time, demonstrated that c-Myc acts as a potent activator of telomerase activity by direct interaction with cis- 
acting regulatory sequences in the TERT promoter. Significantly, the studies of the invention also show that 
transcriptional activation of the hTERT promoter by c-Myc can be abrogated by deletion or mutation of a single 

40 cis-acting regulatory sequence, the M Myc/Max binding site." 

To determine whether experimental induction of c-Myc can lead to the de novo activation of 
telomerase in primary human cells, pre-senescent IMR90 cultures engineered to express the mouse ecotropic 
receptor (Serrano et al. (1997) Cell 88, 593-602) were transduced with either the pBABE retroviral vector or one 
encoding a hormone inducible c-Myc-Estrogen Receptor (cMycER) fusion protein (Eilers et al., 1989 Nature 
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340, 66-68; Littlewood (1995) Nuc. Acids Res. 23, 1686-1690). IMR90 cultures do not possess detectable 
telomerase activity or TERT gene expression (Nakamura et al., 1997; Meyerson et al., 1997). 

Retroviral Infection The mouse ecotropic receptor was transduced into IMR90 fibroblasts and all 
subsequent transductions with ecotropic retrovirus were carried out according to Serrano et al. (1997). pBABE- 
MycER and pBABE vector control viruses were harvested from stable expressing J. cell lines. 

Cell Culture: IMR90 cells were grown in Dulbecco's Modified Eagle Medium (DMEM) (Gibco/BRL) 
supplemented with 10% fetal bovine serum (FBS), 0.29 mg/mL L-glutamine, 0.03% penicillin and streptomycin, 
and 25 ug/mL gentamycin sulfate. For the Myc induction studies in !MR90 cells, MycER transduced cells were 
exposed to 2 uM 4-OHT for 24, 48 and 72 hours. For the promoter studies NIH 3T3 cells were exposed to 1 
pM 4-OHT for 24 and 72 hours. In all cases uninduced controls were treated with an equivalent volume of 
ethanol, the solvent for 4-OHT. 

Telomerase Assays: Telomerase activity was measured by a modified telomerase repeat 
amplification protocol using the TRAPeze™ telomerase detection kit (Oncor, Gaithersburg, MD) (Kim et al., 
1994). Genomic DNA was obtained from vector control or MycER transduced IMR90 fibroblasts. TRAP assays 
were performed on lysates equivalent to 1000 cells for all samples, with 293T cell lysates serving as a positive 
control for telomerase activity. PCR internal controls from each experiment were amplified equally. Inactivation 
of lysate was for 5 minutes at 85°C prior to the TRAP assay. 

In the MycER system, the Myc moiety exists in a latent form bound in a complex with HSP-90 through 
its ER fusion (Eilers et at., 1989; Littlewood et al., 1995). Upon treatment with 4-hydroxy-tamoxifen (4-OHT), 
the MycER protein is liberated from HSP-90, resulting in a Myc over-expression phenotype (Eiiers et al., 1989; 
Littlewood et al., 1995). Employing this cell culture system, 4-OHT treatment of MycER-transduced IMR90 
cultures resulted in the marked and sustained activation of telomerase to a level at or above that detected in 
lysates derived from an equivalent number of telomerase-positive 293T tumor cells, as assayed by the sensitive 
TRAP assay. In contrast, untreated MycER-transduced or 4-OHT-treated pBABE-transduced IMR90 cultures 
remained telomerase negative. Western blot analysis confirmed abundant MycER protein Jevels in the MycER- 
transduced cultures in the presence or absence of 4-OHT. 

Notably, enforced expression of oncogenes such as H-Ras, and cellular modulators of the Rb and p53 
pathways (E7, cyclin D1, Mdm2, dominant-negative p53) have not been found to be capable of influencing 
telomerase activity in IMR90 cells (Wang et al., 1998). 

c-Myc Enhancement of hTERT Transcription Requires the Presence of a C/s- Acting Promoter Element: 
the Proximal Myc-Binding E-Box 

hTERT Reporter Construction: The pGRN150 (E box deleted), pGRN261 (2.5 kbp hTERT reporter) 
are described above. NIH 3T3 cells were grown in Dulbecco's Modified Eagle Medium (DMEM) (Gibco/BRL) 
supplemented with 10% fetal bovine serum (FBS), 0.29 mg/mL L-glutamine, 0.03% penicillin and streptomycin, 
and 25 ug/mL gentamycin sulfate. NIH 3T3 cells were transfected using LipoFectamine reagent (Life Sciences) 
with 100 ng of a promoter reporter, and 200 ng of pCMX-R-Galactosidase which served as an internal control 
for transfection efficiency. Transfected cells were allowed to recover for 6 hours in complete DMEM and then 
treated with 1 pM 4-OHT or ethanol for 36 hours prior to analysis of secreted alkaline phosphatase activity 
using the Great EscAPe™ assay (ClonTech). B -galactosidase activity was assayed by incubation of whole cell 
extracts with 400 ug/ml ONPG in buffer containing 60 mM Na2HP04, 40 mM NaH2P04, 10 mM KCI and 1 mM 
MgS04 and relative transfection efficiencies determined by reading absorbance at 41 5 nm. 

Expression of endogenous hTERT following exposure to 4-OHT (or solvent alone) was measured at 
various times in the presence of 1 uM cyclohexamide in IMR90 fibroblasts transduced with MycER. Reverse 
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transcription of RNA derived from each sample followed by PCR and Southern blotting of the amplified products 
was carried out as described above, Glyceraldehyde-6-phosphate dehydrogenase (GAPDH) was amplified 
from the same reverse transcription products as an internal semi-quantitative control and visualized by ethidium 
bromide staining. Low level expression of hTERT mRNA was detected in uninduced samples after very long 
5 exposures; however, the level of hTERT mRNA did not change over time in the uninduced samples. 

The activity of the hTERT promoter was dramatically enhanced by c-Myc-ER in NIH 3T3 cells. The 
ability of c-Myc-ER to enhance hTERT promoter activity was dependent upon sequences in the hTERT 
promoter that included an evolutionarily conserved Myc binding site (E-box). 

To determine whether the increased telomerase activity induced by activation of c-Myc-ER was a 

10 result of increased transcription of the hTERT gene we initially examined the effect of 4-OHT induction of c- 
Myc-ER activity upon hTERT promoter sequences placed upstream of the secreted alkaline phosphatase 
reporter gene. The hTERT promoter contains two putative Myc-binding sites positioned at -242 and -34 relative 
to the ATG initiation codon. 

NIH 3T3 cells engineered to express c-Myc-ER stably were transfected with constructs containing a 

15 secreted alkaline phosphatase reporter under the control of a 2.5 kb fragment of the hTERT promoter, a 2.5 kb 
fragment of the hTERT promoter lacking the proximal E-box, or a promoterless reporter construct. The basal 
activity of the wild-type hTERT promoter and that of the hTERT promoter lacking the proximal E-box were 
equivalent and approximately 3 fold higher than the activity of the promoterless reporter. Induction of c-Myc-ER 
activity with 1 uM 4-OHT enhanced the activity of the 2.5 kb hTERT promoter approximately 10 fold. By 

20 contrast, the activity of the promoter lacking the proximal E-box was not significantly affected by induction of c- 
Myc-ER. Similarly, the promoterless reporter was not affected by induction of c-Myc-ER. Clearly, this shows 
that transcription of a heterologous encoding region can be regulated by modulating a transcriptional regulatory 
element such as c-Myc within the promoter region, which in turn is modulated by a ligand for the estrogen 
receptor. 

25 To further confirm the role of the proximal E-box in regulating the hTERT promoter we tested the effect 

of changing the E-box from CACGTG to CACTCA. The mutation in the E-box reduced the promoter activity 
due to 4-OHT stimulation to the equivalent of the E-box deletion and 10-fold below the wild-type promoter. This 
demonstrates that c-Myc-ER is not able to significantly activate an hTERT promoter with an attenuated E-box at 
-34 and that the E-box at -242 is not able to significantly mediate c-Myc activation. These results suggest that 

30 the ability of c-Myc to stimulate the hTERT promoter is mediated via the -34 E-box. 

hTERT Is a Direct Target of c-Myc Regulated Transcription 

To confirm the ability of c-Myc to stimulate transcription of the hTERT gene directly, we assayed for 
hTERT gene expression in MycER-transduced cultures of IMR90 cells 0, 1, 3 and 9 hours following the addition 

35 of 4-OHT. The cultures were treated with cyclohexamide for 30 minutes prior to addition of 4-OHT to prevent 
de novo protein synthesis. hTERT expression was undetectable at the zero hour time point for the Myc 
transduced cultures. Pretreatment of these cells with cyclohexamide alone had no effect on expression of 
hTERT mRNA. induction of the c-Myc-ER activity by treatment with 2 M 4-OHT in the presence of 1 
cyclohexamide led to a rapid increase in expression of hTERT message. 

40 hTERT expression was detected by 1 hour post-induction, and increased 3 and 9 hours post induction. 

By contrast, cells treated with solvent alone were not induced to express hTERT. Furthermore, the expression 
level of GAPDH was similar at all time points in cells treated with 4-OHT or solvent alone. These observations 
strongly suggest that Myc acts directly upon the hTERT promoter to enhance transcription of the hTERT gene. 
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Lack of Equivalence ofMyc and TERTin Cellular Transformation. 

To further explore the functional implications of Myc induction of telomerase activity in primary cells, 
we examined whether TERT could substitute for c-Myc as an immortalizing agent in the rat embryonic fibroblast 
(REF) cooperation assay. In this assay, co-transfection of Myc and activated RAS (H-RASG12V) effects the 
5 malignant transformation of early passage REFs. This cooperative activity can be quantified by monitoring the 
number of transformed foci appearing in the monolayer 7 to 10 days post-transfection. In two separate 
experiments, various combinations of the expression constructs encoding c-Myc, H-RASG12V, TERT, or vector 
control were introduced into earfy passage REFs. Strong cooperative activity was observed in the RAS and 
Myc co-transfections as evidenced by an average of 34 foci per 10 cm plate; while Ras alone generated 

10 between 0 and 3 foci per plate; consistent with previous findings that an immortalizing agent and activated RAS 
are required for efficient transformation of primary rodent cells (Land et al., 1983). By contrast, co-transfection 
of TERT and RAS did not generate transformed foci counts above that scored for the RAS alone controls. 
These results indicate that expression of hTERT is insufficient to account for the immortalizing function of Myc 
in a rat embryonic fibroblast (REF) cooperation assay. 

15 Effect of c-Myc-ER on the activity of the hTERT promoter in NIH3T3 celts was determined by detection 

of secreted alkaline phosphatase activity. Cells were treated with 4-OHT for 36 hours. Uninduced cells were 
treated with solvent alone for 36 hours. The detected secreted alkaline phosphatase activity was corrected for 
transfection efficiency in each case using ft -galactosidase. 

20 Example 9: Cloning of mouse TERT promoter 

The following example details the cloning of the mouse mTERT promoter. 

mTERT Construction : A hybridization probe (nucleotides 1586-1970) of the mTERT cDNA 
(pGRN188) was used to identify a recombinant phage (mTERTI) from a 129SV mouse genomic phage library 
(Stratagene). An 8 kb Hindlll fragment of mTERTI that hybridized to the 1586-1970 probe was subcloned into 
25 pBluescript™ II KS + (Stratagene) to generate clone B2.18. The regions encompassing the initiator and 
promoter were sequenced. 

The mTERT upstream sequence is listed in SEQ. ID NO:2 The sequence can be obtained on 
GenBank under Accession B2.18 AF121949. 

Figure 3 shows the alignment of homologous portions of the human and mouse promoter sequences. 
30 The sequences were aligned using the GAP program from the Wisconsin GCG package, using a value of 48 for 
gap creation and a value of 3 for gap extension. Using a small portion of the coding region (~ 450 bases) was 
found to improve the initial alignment. 

Conservation of Human and Mouse TERT Promoters 

35 To determine whether the ability of c-Myc to enhance telomerase activity was mediated through 

increased transcription of the hTERT gene, we compared the sequences of the human and mouse TERT 
promoters. Alignment of the first 300 bases of the human and mouse promoters indicates a number of 
conserved regions. In particular, the Myc/Max binding site (E-box) located at -34 of the human promoter and at 
-32 of the mouse promoter, are highly conserved. A second E-box was identified at -242 of the human 

40 promoter; however, this site was not conserved in the mouse promoter. These observations raised the 
possibility that the conserved Myc binding site in particular might play a role in the regulation of hTERT 
expression by c-Myc 
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Example 10: Exemplary oncolytic virus 

Based on the principles illustrated in Example 4, the following experiment was done as a model for an 
oncolytic virus based on the Ad2 type adenovirus. A construct was made in which the adenovirus E1a 
replication gene was placed under control of the hTERT promoter, which should activate transcription in 
5 telomerase-expressing cancer cells. As a positive control, a similar construct was made in which E1a was 
placed under control of the CMV promoter, which should activate transcription in any cell. 

Reagents were obtained as follows. pBR322, restriction enzymes: NEB, Beverly, MA. Adenovirus 
Type 2 (Ad2), tissue culture reagents: Gibco/BRL, Grand Island, NY. Profection Mammalian Transfection 
Systems: Promega, Madison, Wl. Tumor and Normal Cell lines: ATCC, Manassas, VA, except BJ line, which 
10 was obtained from J. Smith, U. of Texas Southwestern Medical Center. 

Briefly, a pBR322-based plasmid was constructed which contains the Adenovirus Type 2 genome with 
deletions from 356-548nt (E1a promoter region) and 27971 -30937nt (E3). A multiple cloning region was 
inserted at the point of deletion of the E1a promoter, and hTERT promoter (-239 to -36nt) or CMV promoter 
(-524 to -9nt) was subsequently cloned. Numbering of the CMV sequence is in accodance with Akrigg et al., 
15 Virus Res 2:107, 1985. Numbering of the Ad2 sequence is in accordance with "DNA Tumor Viruses: Molecular 
Biology of Tumor Viruses", J. Tooze ed. f Cold Spring Harbor Laboratory, NY. 

These plasmid DNAs were digested with SnaBI to liberate ITRs, then phenol-chloroform extracted, 
precipitated and transfected into 293A cells for propagation of the virus. Several rounds of plaque purifications 
were performed using A549 cells, and a final isolate was expanded on these same cells. Viruses were titered 
20 by plaque assay on 293A cells, and tested for the presence of 5' WT Ad sequences by PCR. DNA was isolated 
from viruses by HIRT extraction. 

The hTERT promoter construct was designated Adph TER T-E 1 dlE3. The CMV promoter construct 
was designated AdCMV-E1dlE3. 

Figure 4 shows the effect of these viruses on normal and cancer-derived cell lines. Each cell line was 
25 plated at 5x10 in a 48-well format and infected at an MOI=20, ~24h post plating. The cells were then cultured 
over a period of 17-48 days, and fed every fourth day. The pictures shown in the Figure were taken 7 days 
after infection. The top row shows the results of cells that were not virally infected (negative control). The 
middle row shows the results of cells infected with oncolytic adenovirus, in which replication gene E1a is 
operably linked to the hTERT promoter. The bottom row shows the results of cells infected with adenovirus in 
30 which E1a is operably linked to the CMV promoter (positive control). Results are summarized in Table 2: 
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TABLE 2: Effect of Oncolytic Virus on Cancerous and Non- 



cancerous Cells 



Origin 



Culture Conditions 



Uninfected Ly * is b * L * sis b V 



cell 
Lysis 



Ad- 
phTERT- 



Ad- 
CMV- 



BJ 


foreskin fibroblast 


90% DMEM/M199 + 
10%FBS 


Fig. 4 (A) 


NO 


NO 


YES 


IMR 


I una fibroblast 


90%DMEM/M199 + 
10% FBS 


Fig. 4 (A) 


NO 


NO 


YES 


WI-38 


lung fibroblast 


90%DMEM/M199 + 

10% FBS + 
5 ug mL gentamicin 


Fig. 4 (A) 


NO 


NO 


YES 


A 549 


lung carcinoma 


90% RPMI + 
10% FBS 


Fig. 4 (B) 


NO 


YES 


YES 


AsPC-1 


adenocarcinoma, 
pancreas 


90% RPMI + 
10% FBS 


Fig. 4(B) 


NO 


YES 


YES 


BxPC-3 


adenocarcinoma, 
pancreas 


90% EMEM + 
10% FBS 


Fig. 4 (B) 


NO 


YES 


YES 


DAOY 


medulloblastoma 


90% EMEM + 
10% FBS 


Fig. 4(C) 


NO 


YES 


YES 


HeLa: 


cervical 
carcinoma 


90% EMEM + 
10% FBS 


Fig. 4 (C) 


NO 


YES 


YES 


HT1080 


fibrosarcoma 


90% EMEM + 
10% FBS 


Fig. 4 (C) 


NO 


YES 


YES 



10 



15 



20 



All cell lines tested were efficiently lysed by AdCMV-E1dlE3 by day 17 post-infection. All tumor lines 
were lysed by AdphTERT-E1 dlE3 in a similar, but slightly delayed time-frame, while normal lines showed no 
signs of cytopathic effect and remained healthy out to 6 weeks post-infection. 

In a parallel experiment, each cell line was infected with an adenovirus containing the gene encoding 
the green fluorescent protein as a visual marker (MOI=100). to determine relative transduction efficiency of 
these cells by adenovirus vectors. The cell lines exhibited a wide range of transduction efficiencies (-1-2% to 
100%). Even cells that are transduced poorly can be efficiently eradicated with the hTERT controlled 
adenovirus. 

Together, the results confirm that a oncolytic virus can be constructed by placing a genetic element 
essential for replication of the virus under control of an hTERT promoter. Replication and lysis occurs in cancer 
cells, but not in differentiated non-malignant cells. 

Figure 5 is a map of the oncolytic adenovirus used in the infection experiment shown in Figure 4 It 
comprises the Inverted Terminal Repeat (ITR) from the adenovirus (Ad2); followed by the hTERT medium- 
length promoter (phTERT176) operably linked to the adenovirus E1a region; followed by the rest of the 
adenovirus deleted for the E3 region (AE3). Shown underneath are some modified constructs. The middle 
construct comprises an additional sequence in between the hTERT promoter and the E1a region. The HI 
sequence is an artificial intron engineered from adenovirus and immunoglobulin intron splice donor and 
acceptor sequences. It is thought that placing an intron in the hTERT promoter adenovirus replication gene 
cassette will promote processing and transport of heteronuclear RNA, thereby facilitating formation of the 
replicated viral particles. The third adenovirus construct is similar, except that the E1a region used is longer at 
the 5' end by 51 nucleotides. It is thought that this may also promote more efficient conditional replication of 
the oncolytic virus. 
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BIOLOGICAL DEPOSIT 



The lambda clone designated *.G(p5 (from which SEQ. ID NO:1 was determined) was deposited under 
terms of the Budapest Treaty with the American Type Culture Collection (ATCC), 10801 University Blvd., 
Manassas, Virginia 20110-2209 U.S.A., on August 14, 1997, under Accession No. 98505. 
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Accession Number 

98505 



C. ADDITIONAL INDICATIONS totwr.^ This infomiation is continued on an additional sheet \J 



Lambda g PHI -5 (Ag(p5) 



* DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE m^a^or^forcU^^ 



(MO 



E. SEPARATE FURNISHING OF INDICATIONS ta,iM ifno.cppHcMt) 



^^S^ l0W ^ lteSUtaiTOd,0 ' te,nlOTM " raB "" ull,,er ^ s P tc, fy * fl " ra ' talurt of the indications, e.g.. 



e.g., "Accession 



0™' 



For receiving Office use only* 



r This sheet was received with the intemationa! application 



Authorized officer 



Form PCT/RO/134 (July 1992) 



For International Bureau use only ■ 
| | sheel w as received by the International Bureau on: 



Authorized officer 



-44- 



WO 00/46355 



CLAIMS 



PCT7US00/03104 



What is claimed as the invention is: 



1. An oncolytic virus having a genome in which a promoter polynucleotide is operably linked to a genetic 
element essential for replication or assembly of the virus, wherein the promoter polynucleotide 
preferentially promotes transcription of the genetic element in cells expressing telomerase reverse 
transcriptase (TERT), thereby promoting replication of the virus, and wherein replication of the virus in a 
cancer cell leads to lysis of the cancer cell. 

2. The oncolytic virus of claim 1 , which is a replication-conditional adenovirus. 

3. The oncolytic adenovirus of claim 2, wherein the genetic element essential for replication is an 
adenovirus E1a region. 

4. The oncolytic virus of any of claims 1-3, further comprising an encoding region whose expression is 
toxic to the cell, or which renders the cell more susceptible to toxic effects of a drug. 

5. The oncolytic virus of claim 4, wherein the encoding region encodes thymidine kinase, and the drug is 
ganciclovir. 



6. The oncolytic virus of any preceding claim, wherein the promoter polynucleotide is a promoter for TERT. 

7. The oncolytic virus of any preceding claim, wherein the promoter polynucleotide is a human telomerase 
reverse transcriptase (hTERT) promoter or a mouse telomerase reverse transcriptase (mTERT) 
promoter. 

8. The oncolytic virus of any preceding claim, wherein the promoter polynucleotide comprises a c-Myc 
binding site. 



9. The oncolytic virus of any preceding claim, wherein the promoter polynucleotide has one or more of the 
following features: 

a) it comprises the sequence from position -1 17 to position -36 relative to the translation initiation 
site (position 13545) of SEQ. ID NO:1 ; 

b) it comprises the sequence from position -239 to position -36 relative to the translation initiation 
site (position 13545) of SEQ. ID NO:1; 

c) it comprises the sequence from position -117 to position +1 relative to the translation initiation 
site (positron 13545) of SEQ. ID NO:1; 

d) it comprises the sequence from position -239 to position +1 relative to the translation initiation 
site (position 13545) of SEQ. ID NO:1; or 

c) it hybridizes with a polynucleotide complementary to a sequence having feature a), b), c), or d) 
under stringent conditions, and has the characteristic of preferentially promoting transcription in cells 
expressing TERT. 
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10. The oncolytic virus of any preceding claim, wherein the promoter has one or more of the following 
features: 

a) it comprises a sequence of at least about 100 consecutive nucleotides in SEQ. ID NO:1 ; 

b) it comprises a sequence of at least about 500 consecutive nucleotides in SEQ. ID NO:1; 

c) it comprises a sequence of at least about 1 00 consecutive nucleotides in SEQ. ID NO:2; 

d) it comprises a sequence of at least about 500 consecutive nucleotides in SEQ. ID NO:2; or 

e) it hybridizes with a polynucleotide complementary to a sequence having feature a), b), c) or d) 
under stringent conditions, and has the characteristic of preferentially promoting transcription in cells 
expressing TERT. 

11. A recombinant polynucleotide in which a promoter is operatively linked to an encoding region, wherein 
the encoding region is preferentially transcribed in cells expressing TERT, and wherein the promoter 
has one or more of the following features: 

a) it comprises the sequence from position -117 to position -36 relative to the translation initiation 
site (position 13545) of SEQ. ID NO:1; 

b) it comprises the sequence from position -239 to position -36 relative to the translation initiation 
site (position 13545) of SEQ. ID NO:1; 

c) it comprises the sequence from position -117 to position +1 relative to the translation initiation 
site (position 13545) of SEQ. ID NO:1; 

d) it comprises the sequence from position -239 to position +1 relative to the translation initiation 
site (position 13545) of SEQ. ID NO:1; or 

c) it hybridizes with a polynucleotide complementary to a sequence having feature a), b), c), or d) 
under stringent conditions. 

12. A recombinant polynucleotide in which a promoter is operatively linked to an encoding region, wherein 
the encoding region is preferentially transcribed in cells expressing TERT, and wherein the promoter 
consists of no more than 82 consecutive nucleotides. 

1 3. An isolated polynucleotide having one or more of the following features: 

a) it comprises a sequence of at least about 100 consecutive nucleotides in SEQ. ID NO:1 ; 

b) it comprises a sequence of at least about 500 consecutive nucleotides in SEQ. ID NO:1 ; or 

c) it hybridizes with a polynucleotide complementary to a sequence having feature a), b), c) or d) 
under stringent conditions, and has the characteristic of preferentially promoting transcription in cells 
expressing TERT; wherein said polynucleotide is not entirely contained in SEQ. ID NO:6 of PCT 
Application W098/14593. 

14. The isolated polynucleotide of claim 1 1 further comprising a heterologous encoding region, wherein the 
encoding region is preferentially transcribed in cells expressing TERT. 

15. A method for selecting a virus having characteristics of an oncolytic virus according to any of claims 1- 
10, comprising providing a recombinant virus in which a promoter polynucleotide is operably linked to a 
genetic element required for replication of the virus, using the virus to infect a cell expressing TERT and 
a cell not expressing TERT, and selecting the virus if it preferentially kills the cell expressing TERT. 
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16. A method of regulating transcription of an encoding region operatively linked to a promoter, wherein the 
promoter preferentially promotes transcription of the encoding region in cells expressing TERT, and the 
method comprises modulating a transcriptional regulatory element within the promoter. 

17. The method of regulating transcription according to claim 16, wherein the transcriptional regulatory 
element is modulated by a factor selected from the group consisting of c-Myc, SP1, SRY, HNF-3B, 
HNF-5, TFIID-MBP, E2F and c-Myb. 

1 8. The method of regulating transcription according to claim 1 7, wherein the factor is c-Myc. 

19. The method of regulating transcription according to claim 18, wherein c-Myc is modulated by contacting 
the cell with a ligand for the estrogen receptor. 

20. The method of regulating transcription according to any of claims 16-19, wherein the encoding region 
encodes TERT. 

21. The method of regulating transcription according to any of claims 16-19, wherein the encoding region is 
heterologous to the promoter. 

22. A method for expressing a polynucleotide in a cell, comprising transducing the cell with a vector in which 
the polynucleotide is operably linked to an hTERT promoter comprising an E box, and then treating the 
cell to increase binding of a transcriptional regulatory factor to the E box. 

23. A method of treating a subject for a disease associated with increased expression of TERT in affected 
cells, comprising administering to the subject an effective amount of the oncolytic virus according to any 
of claims 1-10. 

24. The method of claim 23, wherein the disease is a cancer. 

25. Use of the oncolytic virus according to any of claims 1-10 in the preparation of a medicament for 
treatment of a human or animal body. 

26. Use of the oncolytic virus according to any of claims 1-10 in the preparation of a medicament for 
treatment of cancer. 
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SEQUENCE LISTING 



SEQ. ID NO:l (hTERT gene sequence in GenBank Accession AF121948) 



GCGGCCGCGA 
GCATTGCCGA 
AAACACAGGA 
AAGTTCCATC 
AAATACTTTA 
AGATCCAAGA 
ACAGTCAAAT 
AGATTAATGA 
ACAAAAGACA 
CACTGAAAGT 
CCAGCACTTT 
TGGCCAATAT 
CACATGCCTG 
GGCAGAGGTG 
TCCATCTCAA 
CATATATAAA 
TATAAATCTA 
ATATAAATAT 
ATATACATAT 
AAATATATAA 
ACATATATAT 
ACATATAAGT 
AAACCAATAA 
AGGAAAGAAA 
GGAGTAATTC 
AAAAGACATA 
AGAATATACT 
CTATGCAAAT 
ATTTCAAGAC 
TATAACAATT 
ATTATTAGAA 
GCTTTTAGCA 
ATAATATAGA 
AATATGCATT 
CAGAACAAGC 
TTACAGCACT 
GCCTGGGCAA 
AGTGGTGTGT 
CCAAGAGTTC 
GAATGAGACC 
GGCCACAGTG 
ACATGAAAAT 
GAAATTGAAA 
TATACAGCAA 
GTAGAAAAGC 
GGGCAGATCG 
CGCTACTAAA 
CGGGAGGCTG 
ATTGCGCCAT 
AAAAGTAGAA 
GAGCAAACTA 
AATGAAACTG 
AGATAAACAA 
AAATAAAGTC 
CACTAGAGGC 
TAAATTCCTA 
CAGACCAATA 
CCAGGACCCA 
ATCCTACTCA 
GCCAGTATTA 
AACAGAAAGA 
AACAAAACAC 
GTGGGATTTA 
CATCATCCCA 
GCATTTGATA 
AAACATACAG 
GTGGGATGAT 
GTCTACAAAA 
GCTAGTCTGG 
GCCATGAACA 
AGAAGGAGAA 



GCTCTAATAC 
AGAAAAGATT 
AAAAAAAGAT 
GGCCTTACAT 
AGAAATAATG 
AGCTCAACAA 
TGCTGAAAAC 
CAGGCCAAGA 
TTTTTTAAAA 
ATATTTCAAA 
GGGAGGCCAA 
AGCGAAACCC 
TAATCCCAGG 
GTGAGCCAAG 
AAACAAACAA 
TATATATACA 
TATACATATA 
ACATATATAA 
ATAAATATAT 
ATATACAAGT 
ATAAATATAT 
CTCATGGTAA 
ATTAAATCAT 
GAAGGAAGAG 
CTGACTTATC 
GAGTGGCTGA 
TCACCTATAA 
GGAAACCAAA 
AAAAAGTACA 
GTGAATTTAT 
CTAAGGAGAG 
TTGGACAGAT 
ACAAATGTAC 
TTTTCCTCAG 
CATTAAAAAT 
TTGGGGAGGG 
AATAGTGAGA 
GCCTGTAGTC 
AAGGCTACGG 
CTGTCTCAAA 
GAACAAAACC 
TAAACAATAT 
AATTTATTTA 
AAGCAGTGCT 
CAGGCGCAGT 
CCTGAGGTCA 
AATACAAAAT 
AGGCAGGATA 
TGGACTCCAG 
AAACTTAAAA 
AACCTAAAAT 
AAAGATAACA 
AATTGACAAA 
AGAGATGAAA 
TACTATGAGC 
GATGCATACA 
ACAATAATGG 
ATGGCTTCCC 
AACTATTCTG 
CCCTGATTCC 
AAGAAAACTA 
TAGCAAACCA 
TTCCAGGGAT 
ACAAAATGAA 
AAA TTCTG CA 
GCCAGGCACA 
TGCTTGGGCC 
AACTTTTTTA 
AGGCTGAGGT 
TGTCACTGTA 
GGAGAAGGGA 



GACTCACTAT 
AATGGATTTG 
AAAGAAACGA 
ATGTGTAAGC 
TCTAAAAGTT 
AACAAAGCAC 
CAGCAACAAA 
AACAATGAAA 
CCAAAAGGAA 
ACATATTTTA 
GGTGGGTGGA 
CATCTGTACT 
TACTCAGGAG 
ATTGCACCAG 
ACAAAATACA 
CATATATAAA 
TATACATATA 
ATACATATAT 
ACATATATAA 
ATATACAAAT 
AAAAAAACTT 
CCTCAAATAA 
GCCACCAGAA 
AAGACCATGA 
AATAATAATG 
ATGGACGAAA 
AGGGACACAT 
AAAAGAACAG 
AAAAGAGACA 
ATGCGCCCAA 
AGAGAGATCC 
CATCCAGACA 
CTAATTGATG 
CATATGGATC 
TCAAAAAAAT 
TGAGGTGGGA 
CCCTGTCTCT 
CCAGCTACTT 
TGAGCCATGA 
AAAAAAAAAA 
AGAAATCAAC 
ACTTCTGAAT 
AGCAAATGAT 
AAGAAGGAAG 
GGCTCATGCC 
GGAGTTCGAG 
TAGCTGGGCA 
ACCGCTTGAA 
CCTGGGTAAC 
ATACAACCTA 
TGGTAAAAGA 
ATACAAAAGA 
CCTTTGCCCA 
AAAGAGACAT 
AACTGTACAC 
ACCTACCAAG 
GATTAAAGCC 
TGCTGGATTT 
AAAAATAGAG 
AAAACCAGAC 
CAGGCCAATA 
AATTAAACAA 
GGAAGGATGG 
GTACAAAAAC 
CCCTTCATGA 
GTGGCTCACA 
CAGGAGTTTG 
AAAAATTAGC 
GGGAGAATCA 
CTCCAGCCTA 
GAAAGGAGGG 



AGGGCGTCGA 
AACACACAGC 
AAAGAAAAGG 
AGAGGCCCTG 
TTTCAAATAT 
AAGAAACAGG 
GAGAATATCT 
ACAATACAGA 
AAAAAATGCT 
GGCCAGGCTT 
TCGCTTAAGG 
AAAAACACAA 
GCTAAGGCAG 
TGCACTCCAG 
TATACATAAA 
TCTATATACA 
TAATATATTT 
AAATATACAT 
ATATACATAT 
ATATACATAT 
TTGGCTGGGC 
AAAAACATAT 
GAAATTACCT 
AACAACCAGA 
CTGGGTGTAA 
AAAACAAGAC 
AGACTGAAAA 
AACTAGCTAC 
AAGTAATTAT 
CACTGGGACA 
CCATACAATA 
GAAAATCAAC 
TTTACAAGAC 
ATTCTCAAGG 
TGAGCCAGGC 
GGATGTCTTG 
ACAAACTTTT 
AGGAGGCTGA 
TTGCAACACC 
AAAATTGAAA 
AACAAGAGGA 
AACCAGTGAG 
AACGGAAACA 
TTTATAGCTA 
TGTAATCCCA 
ACCAGCCTGA 
TGGTGGCACA 
CCCAGGAGGT 
AAGAGTGAAA 
ATGATGCACC 
AAAGAAATAA 
TCAACAAAAT 
GACTAAGAAA 
TACAACTGAT 
TAATAAATTG 
ATTGAACCAT 
ATAATAAAAA 
TACCAATCAT 
GAAAGAATAC 
AAAAACACAT 
TCCCTGATGA 
CACCTTCGAA 
TTCAACATAT 
TATATGATTA 
TAAAAACCCT 
CCTGCGATCC 
AGACTAGCCT 
CAGGCATGAT 
CTTAAGCCTA 
GACAACAGAA 
AGAAGGGAGG 



CTCGATCAAT 
AACAGAAACT 
GCATCAGTGA 
TAGGAGCAGA 
GAGGAAAAAC 
AAGAAATTAA 
TAAGAGTATC 
TTTCTTGTAG 
ACATTAAAAT 
GGTGGCTCAC 
TCAGGAGTTC 
AAATTAGCTG 
GAGAATTGCT 
CCTTGGTGAC 
TATATATGCA 
TATATACATA 
ACATATATAA 
ATATAAATAT. 
ATAAATATAT 
ATAAATGTAT 
ACCTTTCCAA 
AACAGATACA 
TCACTAAAAG 
AAACAAACAA 
ATGGACTAAA 
TCAATAATCT 
TAAAAGGAAG 
ACTTATATCA 
ATAATAATAA 
CCCAGATATA 
ATAGCTGGAG 
CAAAAAATTG 
ATTTCATCCA 
ATAGACCATA 
ATGATGGCTT 
AGTACAGGAG 
TTTTTTAATT 
AGTGGGAGGA 
ACACACCAGC 
TAATATAAAG 
ATTTTGAAAA 
TCAATGAAGA 
TAACCTCTCA 
TAAGCAGCTA 
GCACTTTGGG 
CCAACACAGA 
TGCCTGTAAT 
GGAGGTTGCG 
CCCTGTCTCA 
TTAAAGAACT 
TAAAGATCAG 
TAAAAGTTGG 
AAAGGAAAGA 
ACCACAGAAA 
AAAAACCTAG 
GAAGAAATCC 
GTCTCCTAGC 
TTAAAGAAGA 
TTCCAAACTC 
CAAAAACAAA 
ATACTGATAC 
AGATCATTCA 
GCAAATCAAT 
TTTCACTTTA 
CAAAAAACCA 
CAGCACTCTG 
GGGCAACAAA 
GGCATATGCC 
GGAGGTCGAG 
CAAGACCCCA 
AGGAGGAGAA 



GGAAGATGAG 
ACATGAAGTG 
GCTTCAGCAG 
GGCAGGGGGA 
ATAAAACCAC 
AAGTTATATC 
AGAGGAAAAG 
GAAACACAAG 
GTTTTTTACC 
ACCTGTAATC 
GAGACCAGCC 
GGTGTGGTGA 
TGAACTGGGA 
AGAGTGAAAC 
CATATATATA 
TATACACATA 
ATATATACAT 
ACATATATAA 
ATACATATAT 
ATACGTATAT 
ATCTCATGGC 
CCAAAAATAA 
GAACACAGGA 
CAAAACAGCA 
CTCTCCAATC 
GTTGCCTACA 
GAAAAATATT 
GACAAAATAG 
AGCAAAAAGA 
TACAGCAAAT 
ACTTCACCCC 
GACTTAATCT 
GTAGTTGCAG 
TATTAGGCCA 
ATGCTTGTAA 
TTTGAGACCA 
AGCCAGGCAT 
TCACTTGAGC 
CTTGGTGACA 
CATCTTCTCT 
CTATACAAAC 
AATTAAAAAG 
AAACCCACGG 
CATCAAAAAA 
AGGCCAAGGC 
GAAACCTTGT 
CCCAGCTACT 
GTGAGCCGGG 
AGAAAAAAAA 
AGAAAAGCAA 
AGCAGAAATA 
TTTTTTGAAA 
AGACCTAAAT 
TTCAAAGGAT 
AAAAAATAGA 
AAAGCCCAAA 
AAAGAGAAGC 
ATGAATTCCA 
ATTCTACATG 
CAAACAAAAA 
AAAAATCCTC 
TTGTGATCAA 
CAATGTGATA 
TGCAGAAAAA 
GGTATACAAG 
GGAGGCCAAG 
ATGAGACCTG 
TGTAGTCCCA 
GCTGCAGTGA 
CTGAATAAGA 
GGAGGAGGTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 



1 



WO 00/46355 PCT/USOO/03104 

GAGGAGAAGT GGAAGGGGAA GGGGAAGGGA AAGAGGAAGA AGAAGAAACA TATTTCAACA 4320 

TAATAAAAGC CCTATATGAC AGACCGAGGT AGTATTATGA GGAAAAACTG AAAGCCTTTC 4380 

CTCTAAGATC TGGAAAATGA CAAGGGCCCA CTTTCACCAC TGTGATTCAA CATAGTACTA 4440 

GAAGTCCTAG CTAGAGCAAT CAGATAAGAG AAAGAAATAA AAGGCATCCA AACTGGAAAG 4500 

GAAGAAGTCA AATTATCCTG TTTGCAGATG ATATGATCTT ATATCTGGAA AAGACTTAAG 4 560 

ACACCACTAA AAAACTATTA GAGCTGAAAT TTGGTACAGC AGGATACAAA ATCAATGTAC 4 620 

AAAAATCAGT AGTATTTCTA TATTCCAACA GCAAACAATC TGAAAAAGAA ACCAAAAAAG 4 680 

CAGCTACAAA TAAAATTAAA CAGCTAGGAA TTAACCAAAG AAGTGAAAGA TCTCTACAAT 474 0 

GAAAACTATA AAATATTGAT AAAAGAAATT GAAGAGGGCA CAAAAAAAGA AAAGATATTC 4800 

CATGTTCATA GATTGGAAGA ATAAATACTG TTAAAATGTC CATACTACCC AAAGCAATTT 4 860 

ACAAATTCAA TGCAATCCCT ATTAAAATAC TAATGACGTT CTTCACAGAA ATAGAAGAAA 4920 

CAATTCTAAG ATTTGTACAG AACCACAAAA GACCCAGAAT AGCCAAAGCT ATCCTGACCA 4980 

AAAAGAACAA AACTGGAAGC ATCACATTAC CTGACTTCAA ATTATACTAC AAAGCTATAG 5040 

TAACCCAAAC TACATGGTAC TGGCATAAAA ACAGATGAGA CATGGACCAG AGGAACAGAA 5100 

TAGAGAATCC AGAAACAAAT CCATGCATCT ACAGTGAACT CATTTTTGAC AAAGGTGCCA 5160 

AGAACATACT TTGGGGAAAA GATAATCTCT TCAATAAATG GTGCTGGAGG AACTGGATAT 5220 

CCATATGCAA AATAACAATA CTAGAACTCT GTCTCTCACC ATATACAAAA GCAAATCAAA 5280 

ATGGATGAAA GGCTTAAATC TAAAACCTCA AACTTTGCAA CTACTAAAAG AAAACACCGG 5340 

AGAAACTCTC CAGGACATTG GAGTGGGCAA AGACTTCTTG AGTAATTCCC TGCAGGCACA 5400 

GGCAACCAAA GCAAAAACAG ACAAATGGGA TCATATCAAG TTAAAAAGCT TCTGCCCAGC 5460 

AAAGGAAACA ATCAACAAAG AGAAGAGACA ACCCACAGAA TGGGAGAATA TATTTGCAAA 5520 

CTATTCATCT AACAAGGAAT TAATAACCAG TATATATAAG GAGCTCAAAC TACTCTATAA 5580 

GAAAAACACC TAATAAGCTG ATTTTCAAAA ATAAGCAAAA GATCTGGGTA GACATTTCTC 5640 

AAAATAAGTC ATACAAATGG CAAACAGGCA TCTGAAAATG TGCTCAACAC CACTGATCAT 5700 

CAGAGAAATG CAAATCAAAA CTACTATGAG AGATCATCTC ACCCCAGTTA AAATGGCTTT 5760 

TATTCAAAAG ACAGGCAATA ACAAATGCCA GTGAGGATGT GGATAAAAGG AAACCCTTGG 5820 

ACACTGTTGG TGGGAATGGA AATTGCTACC ACTATGGAGA ACAGTTTGAA AGTTCCTCAA 5880 

AAAACTAAAA ATAAAGCTAC CATACAGCAA TCCCATTGCT AGGTATATAC TCCAAAAAAG 5940 

. GGAATCAGTG TATCAACAAG CTATCTCCAC TCCCACATTT ACTGCAGCAC TGTTCATAGC 6000 

AGCCAAGGTT TGGAAGCAAC CTCAGTGTCC ATCAACAGAC GAATGGAAAA AGAAAATGTG 6060 

GTGCACATAC ACAATGGAGT ACTACGCAGC CATAAAAAAG AATGAGATCC TGTCAGTTGC 6120 

AACAGCATGG GGGGCACTGG TCAGTATGTT AAGTGAAATA AGCCAGGCAC AGAAAGACAA 6180 

ACTTTTCATG TTCTCCCTTA CTTGTGGGAG CAAAAATTAA AACAATTGAC ATAGAAATAG 6240 

AGGAGAATGG TGGTTCTAGA GGGGTGGGGG ACAGGGTGAC TAGAGTCAAC AATAATTTAT 6300 

TGTATGTTTT AAAATAACTA AAAGAGTATA ATTGGGTTGT TTGTAACACA AAGAAAGGAT 63 60 

AAATGCTTGA AGGTGACAGA TACCCCATTT ACCCTGATGT GATTATTACA CATTGTATGC 6420 

CTGTATCAAA ATATCTCATG TATGCTATAG ATATAAACCC TACTATATTA AAAATTAAAA 6480 

TTTTAATGGC CAGGCACGGT GGCTCATGTC CATAATCCCA GCACTTTGGG AGGCCGAGGC 654 0 

GGTGGATCAC CTGAGGTCAG GAGTTTGAAA CCAGTCTGGC CACCATGATG AAACCCTGTC 6600 

TCTACTAAAG ATACAAAAAT TAGCCAGGCG TGGTGGCACA TACCTGTAGT CCCAACTACT 6660 

CAGGAGGCTG AGACAGGAGA ATTGCTTGAA CCTGGGAGGC GGAGGTTGCA GTGAGCCGAG 6720 

ATCATGCCAC TGCACTGCAG CCTGGGTGAC AGAGCAAGAC TCCATCTCAA AACAAAAACA 6780 

. AAAAAAAGAA GATTAAAATT GTAATTTTTA TGTACCGTAT AAATATATAC TCTACTATAT 6840 

TAGAAGTTAA AAATTAAAAC AATTATAAAA GGTAATTAAC CACTTAATCT AAAATAAGAA 6900 

CAATGTATGT GGGGTTTCTA GCTTCTGAAG AAGTAAAAGT TATGGCCACG ATGGCAGAAA 6960 

TGTGAGGAGG GAACAGTGGA AGTTACTGTT GTTAGACGCT CATACTCTCT GTAAGTGACT 7020 

TAATTTTAAC CAAAGACAGG CTGGGAGAAG TTAAAGAGGC ATTCTATAAG CCCTAAAACA 7080 

ACTGCTAATA ATGGTGAAAG GTAATCTCTA TTAATTACCA ATAATTACAG ATATCTCTAA 7140 

AATCGAGCTG CAGAATTGGC ACGTCTGATC ACACCGTCCT CTCATTCACG GTGCTTTTTT 7200 

TCTTGTGTGC TTGGAGATTT TCGATTGTGT GTTCGTGTTT GGTTAAACTT AATCTGTATG 7260 

AATCCTGAAA CGAAAAATGG TGGTGATTTC CTCCAGAAGA ATTAGAGTAC CTGGCAGGAA 7320 

GCAGGTGGCT CTGTGGACCT GAGCCACTTC AATCTTCAAG GGTCTCTGGC CAAGACCCAG 73 80 

GTGCAAGGCA GAGGCCTGAT GACCCGAGGA CAGGAAAGCT CGGATGGGAA GGGGCGATGA 7440 

GAAGCCTGCC TCGTTGGTGA GCAGCGCATG AAGTGCCCTT ATTTACGCTT TGCAAAGATT 7500 

GCTCTGGATA CCATCTGGAA AAGGCGGCCA GCGGGAATGC AAGGAGTCAG AAGCCTCCTG 7560 

CTCAAACCCA GGCCAGCAGC TATGGCGCCC ACCCGGGCGT GTGCCAGAGG GAGAGGAGTC 7 620 

AAGGCACCTC GAAGTATGGC TTAAATCTTT TTTTCACCTG AAGCAGTGAC CAAGGTGTAT 7680 

TCTGAGGGAA GCTTGAGTTA GGTGCCTTCT TTAAAACAGA AAGTCATGGA AGCACCCTTC 7740 

TCAAGGGAAA ACCAGACGCC CGCTCTGCGG TCATTTACCT CTTTCCTCTC TCCCTCTCTT 7800 

GCCCTCGCGG TTTCTGATCG GGACAGAGTG ACCCCCGTGG AGCTTCTCCG AGCCCGTGCT 7860 

GAGGACCCTC TTGCAAAGGG CTCCACAGAC CCCCGCCCTG GAGAGAGGAG TCTGAGCCTG 7920 

GCTTAATAAC AAACTGGGAT GTGGCTGGGG GCGGACAGCG ACGGCGGGAT TCAAAGACTT 7980 

AATTCCATGA GTAAATTCAA CCTTTCCACA TCCGAATGGA TTTGGATTTT ATCTTAATAT 8040 

TTTCTTAAAT TTCATCAAAT AACATTCAGG AGTGCAGAAA TCCAAAGGCG TAAAACAGGA 8100 

ACTGAGCTAT GTTTGCCAAG GTCCAAGGAC TTAATAACCA TGTTCAGAGG GATTTTTCGC 8160 

CCTAAGTACT TTTTATTGGT TTTCATAAGG TGGCTTAGGG TGCAAGGGAA AGTACACGAG 8220 

GAGAGGACTG GGCGGCAGGG CTATGAGCAC GGCAAGGCCA CCGGGGAGAG AGTCCCCGGC 82 80 

CTGGGAGGCT GACAGCAGGA CCACTGACCG TCCTCCCTGG GAGCTGCCAC ATTGGGCAAC 834 0 

GCGAAGGCGG CCACGCTGCG TGTGACTCAG GACCCCATAC CGGCTTCCTG GGCCCACCCA 8400 

CACTAACCCA GGAAGTCACG GAGCTCTGAA CCCGTGGAAA CGAACATGAC CCTTGCCTGC 84 60 

CTGCTTCCCT GGGTGGGTCA AGGGTAATGA AGTGGTGTGC AGGAAATGGC CATGTAAATT 8520 

ACACGACTCT GCTGATGGGG ACCGTTCCTT CCATCATTAT TCATCTTCAC CCCCAAGGAC 8580 

TGAATGATTC CAGCAACTTC TTCGGGTGTG ACAAGCCATG ACAACACTCA GTACAAACAC 8640 

CACTCTTTTA CTAGGCCCAC AGAGCACGGC CCACACCCCT GATATATTAA GAGTCCAGGA 8700 

GAGATGAGGC TGCTTTCAGC CACCAGGCTG GGGTGACAAC AGCGGCTGAA CAGTCTGTTC 87 60 

CTCTAGACTA GTAGACCCTG GCAGGCACTC CCCCAGATTC TAGGGCCTGG TTGCTGCTTC 8820 

CCGAGGGCGC CATCTGCCCT GGAGACTCAG CCTGGGGTGC CACACTGAGG CCAGCCCTGT 8880 
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CTCCACACCC 
CCGTGTTCCA 
GCACGGTTCC 
GGAGGAGATT 
CGATGCAGGT 
CCTGTCATCT 
AGCTGCGTGT 
TGGTGGGCCA 
CCTCACCTAG 
CTCTGCCCAG 
CACTAAGCAT 
CCCTGGGAAT 
GCTGTTTTAT 
CTGGTTAAAC 
GCCGTTTATA 
CATGGGATAC 
GTTGGGGGGT 
AAGCCAGTTT 
TGGGGATGGG 
ATAATGCTCT 
GCCCCAGGGC 
ACTACCTGCA 
GAGGGGGGCA 
CCTCGAGCCC 
ACGGAGCCTG 
TCCGGCCTCC 
ATTTGCAGAA 
TTTACAGAAA 
AGTGATTTTA 
TGTTGCCCAG 
GTTCAAGCAA 
ACACCCGGCT 
TCTCAAAATC 
AGGCATGAGC 
ACACCCACTG 
TTTGATATTT 
GTTTCTGTGA 
GCTTCAGGTC 
GAGTGTGGAC 
AAGTCCATCC 
AGGAGTTCCT 
ACTGAATCCA 
TTGTTGCTCA 
CCCAGGTTCA 
CCACCATGCC 
CCATGTTGGC 
CTAAAGTGCT 
GAAACATCTG 
AATGATAGAA 
AGACATCATC 
GGTATCAGCG 
TTACTCCAGC 
CTATGTTGGC 
ATTTTCCAAA 
AAAGGCTTAG 
CAAGACGAGG 
ATGCTAGCTC 
ATTTAAGGTT 
AAGGCCTCGG 
TCTGGATTCC 
GGGACCAGTG 
GTCCGAGGCT 
GATGTGACCA 
GTTGTGGCTG 
ACCCTTTCTC 
GGTGGGGACC 
GCCCAAGTCG 
GGGAGCAATG 
CACGTCCGGC 
CTGGGTCTCC 
TCCACATCAT 
GCTGGGGCCC 
GCGCGGCCCA 
CCCAGTGGAT 
GGGACCCGGG 
CGCCCCGTCC 
TCCCCTTCCT 
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TCCGCCTCCA 
GCGCTACTGT 
TCCTCACATG 
CTGCGCCTCC 
TCCTGGCGTC 
GCCGGGGCCT 
GTCTCTGTCC 
GGGCGCTCTT 
GTCCACGGGC 
CACTTTTCTG 
CCTCTTCCCA 
TCACGTGACT 
TTTAATAGCT 
AAACGGGTCC 
AAGCCTGCAG 
GTACGCAACA 
TAAGGACGGT 
CCTGGTTCTG 
GGAACCCGGA 
AGAGATGCCC 
CTTTGCAGGT 
GGCCCGAAAA 
GCCTCAGGAC 
AGGCCTGCAA 
CAGCAGGAAG 
GTGCCATAGG 
GCAACAGGAA 
CATCCAAGGA 
TTTAGCTATT 
GCTGGAGTGC 
TTCTCGTGCC 
AATTTTGTAT 
CTGACCTCAG 
CACTGCACCT 
GTAAGGAGTT 
TCTGTAATTC 
CCACCTGTTA 
CCAGTGGGGT 
ACTGTCCTGA 
CTCCTACTCT 
CTCACTCCTG 
CTGTTTCATT 
GGCTGGAGGG 
AGTGATTCTC 
CAGCTAATTT 
CAGGCTGGTC 
GGGATTACAG 
GGTCTGAGGT 
TTTTTTTATT 
AGCTTTTCAA 
ATCTTCATTG 
ATAATCTTCT 
TTCTCTGCAG 
CCGCCCCTTT 
GGATCACTAA 
CTAACCTCCA 
CATAAATAAA 
GCGTTTGTTA 
GAGACCCAGA 
TGGGAAGTCC 
GCCGTGTGGC 
TGGAGCCAGG 
GATGTTGGCC 
GTGTGAGGCG 
GACGGGACCG 
CCTCGCCGCC 
CGGGGAAGTG 
CGTCCTCGGG 
ATTCGTGGTG 
GGATCAGGCC 
GGCCCCTCCC 
TCGCTGGCGT 
GACCCCCGGG 
TCGCGGGCAC 
CACCCGTCCT 
CGACCCCTCC 
TTCCGCGGCC 



GGCCTCAGCT 
CTCACCTGTC 
GGGTGTCTGT 
CAGACTGGCT- 
CGGCTGCACG 
GCCGGTGTGT 
GCTAGGGTCT 
GGGAAATGCA 
ACAGGCCTGG 
CCCCCCTCCC 
AAAGACCCAG 
ACGCACATCA 
ACAAAGCAGG 
ATCCGCACGG 
GCATCTCAAG 
TGCTCAAAAA 
GGGGGCAGCA 
ATGGTATTGG 
GGCTGTGCCA 
ACGTCCTGAT 
GTGATCTCCG 
GTAATCCAGG 
GATGGAGGCA 
GCGCCTCCAG 
GCACGGCTGG 
AGGGCACTCG 
ACCCATGCAC 
CAGGGCTGAA 
TTATTTTATT 
AGCGGCATGA 
TCAGCCTCCC 
TTTTAGTAGA 
GTGATCCGCC 
GGCCTATTTA 
CATGGAGTTC 
TTCGTAGACT 
TCCCATGGGA 
TGCCATCTGC 
ATCTCAATGT 
ACTGGGATTG 
TGGAGGAAGG 
TGTTGGTTTG 
AGTGCAATGG 
CTGCTTCCGC 
TTTGTATTTT 
TCGAACTTCT 
GTGTGAGCCA 
AGGAAGCTCA 
GTTGTTAGAA 
AGACACACTA 
AATGCCGGGA 
GCTTCCATTT 
AGAACCAGTG 
GCCCTAGTGG 
GGGGATTTCT 
GCGAGCGTGA 
GCAATTTCCT 
GCATTTCAGT 
AGTTTCTCGC 
TCAGCTGTCC 
TTCTACTGCT 
TGCCTGGACC 
TCATCTGCCA 
CCCGGTGCGC 
CCCCGGTGGG 
TGAGAACCTG 
TTGCAGGGAG 
TTCGTCCCCA 
CCCGGAGCCC 
AGCGGCCAAA 
TCGGGTTACC 
CCCTGCACCC 
TCCGCCCGGA 
AGACGCCCAG 
GCCCCTTCAC 
CGGGTCCCCG 
CCGCCCTCTC 



TCTCCAGCAG 
CCACTGTGTC 
CTCCTTCCCC 
CCTCTGAGCC 
CTGACCTCCA 
TCTTCTGTTT 
CGGGGTTTTT 
ACATTTGGGT 
GGATGGAGCC 
TCTGGAACAC 
CATTGGCACC 
TGTACACACT 
GAAATCCCTG 
TGGACAGTTC 
GGAATTACGC 
GAAAGAATTT 
GCTGGGGGCT 
CTCAGTTATG 
TCTTTGCCAT 
TCCCCCAAAC 
TGAGGACCCT 
GGTTCTGGGA 
GTCAGTCTGA 
AAGCTGGAAA 
CCCTTAGCCC 
CGCTGCCCTT 

TGTGAATCTA 
GTGCCTCCGG 

TACTTACTTT 
TCTTGGCTCA 
AAGTAGCTGG 
GATGGGCTTT 
CACCTCAGCC 
ACCATTTTAA 
AATTTCCCCT 
GGGGATACAC 
CCCACTGCAG 
CAGTAGAAAC 
CTCAGTGTGT 
AGCCCCTTCC 
AATGATACTT 
TTTGTTTTGT 
CGCGATCTTG 
CTCCCATTTG 
TAGTAGAGAC 
GACCTCAGAT 
CCATGCCCAG 
CCCCACTCAA 
CACTCTTGAT 
ACTGCACCCA 
GGCGTTTCCT 
CTTCTCTTCC 
TAAGCTACAA 
CAGAGACAAT 
AGAAGAGCGA 
CAGCCCAGGG 
CCGGCAGTTT 
GTTTGCCGAC 
CCCTTAGATC 
TGCGGTTGTG 
GGGCTGGAAG 
CCGAGGCTGC 
GACAGAGTGC 
GGCCAGCAGG 
TGATTAACAG 
CAAAGAGAAA 
GCACTCCGGG 
GCCGCGTCTA 
GACGCCCCGC 
GGGTCGCCGC 
CCACAGCCTA 
TGGGAGCGCG 
GCAGCTGCGC 
GACCGCGCTT 
CTTCCAGCTC 
GCCCAGCCCC 
CTCGCGGCGC 



CTTCCTAAAC 
TTGTCTCAGC 
AACACTCACA 
TGAACCTGGC 
TTTCCAGGCG 
CTGTGCTCCT 
ATAGGCATAG 
GTGAAAGTAG 
CCCGCCAGGG 
AGAGTGGCAG 
CCTGGACATT 
CCCGTCCACG 
CTAAAATGTC 
CTCACAGTGA 
TGAGTCAAAA 
CACCCCATGG 
ACTGCACGCA 
GGAGACTAAC 
GCCCGAGTGT 
CTGTGGACAG 
GAGGTCTGGG 
AGAGGCGGGC 
GGCTGAAAAG 
AAGCGGGGAA 
ACCAGGGCCC 
CTAGCATGAA 
GGATTATTTC 
GCAAGGGCAG 
CTGAGACAGA 
CTGCAACCTC 
GATTTCAGGC 
CACCATGTTG 
TCCCAAAGTG 
AACTTCCCTG 
TTACTCAGGA 
CGTCTCTTGA 
GGGCAGCTGG 
CTGATGTAGA 
GCTGAAACAT 
CTATCCCCCC 
TGTTATTTTT 
TTTGAGAGGC 
GCTTACTGCA 
GCTGGGATTA 
GGGGGTGGGG 
GATCCACCTG 
CTCAGAATTT 
GTGTTGTGGT 
GTTTTACACT 
TAATACTGGG 
CGCCATGCAC 
CTCTTTTAAA 
CTTAACTTTT 
TCACAAACAC 
CCCGTAATCC 
AGGGTGCGAG 
CTGAAAGTAG 
CTCAGCTACA 
CAAACTTGAG 
CCGGGGCCCC 
TCGGGCCTCC 
CCTCCACCCT 
CGGGGCCCAG 
AGCGCCTGGC 
ATTTGGGGTG 
TGACGGGCCT 
AGGTCCCGCG 
CGCGCCTCCG 
GTCCGGACCT 
ACGCACCTGT 
GGCCGATTCG 
AGCGGCGCGC 
TGTCGGGGCC 
CCCACGTGGC 
CGCCTCCTCC 
CTCCGGGCCC 
GAGTTTCAGG 



CCTGGGTGGG 
GACGTAGCTC 
TGCGTTGAAG 
TCGTGGCCCC 
CTCCCCGTCT 
TTCCACGTCC 
GACGGGGGCG 
GAGTGCCTGT 
ACCCGCCCTT 
TTTCCACAAG 
TGCCCCACAG 
ACCGACCCCC 
CTTTAACAAA 
AGAGGAACAT 
CTGCCACCTC 
CAGGGGAGTG 
CCTTTTACTA 
CATAGGGGAG 
CCTGGGCAGG 
AACCCGCCCG 
ATCCTTCGGG 
AGGAGGGTCA 
GGAGGGAGGG 
GGGACCCTCC 
ATCGTGGACC 
GTGTGTGGGG 
AAAACAAAGG 
GGCAGGCACG 
GTTATGCTCT 
CGTCTCCTGG 
GTGCACCACC 
GTCAGGCTGA 
CTGGGATTAC 
GGCTCAAGTC 
GTTACCCTCC 
CATATTCACA 
GAGGCTGCAG 
ATCAGGGCGC 
GTAGAAATTA 
CCAGGGGCAG 
CACTGCTGGT 
GGTTTCACTC 
GCCTCTGCCT 
CAGGCACCCG 
GTGGGGTTCA 
CCTCTGCCTC 
ACTCTGTTTA 
GTTTTAAGCC 
GTGATGACTA 
GTGTCTTCTG 
ATGGTGTTAA 
ATTGTGTTTT 
GTTGGAACAA 
AGCCCTTTAA 
TAAGTATTTA 
GCCTGTTCAA 
GAAAGGTTAC 
GCATCCCTGC 
CAACCCGGAG 
AGGTCTGGAG 
TAGCTCTGCA 
GTGCGGGCGG 
GGTCAAGGCC 
TCCATTTCCC 
GTTTGCTCAT 
GTGTCAAGGA 
TGCCCGTCCA 
TCCTCCCCTT 
GGAGGCAGCC 
TCCCAGGGCC 
ACCTCTCTCC 
GGGCGGGGAA 
AGGCCGGGCT 
GGAGGGACTG 
GCGCGGACCC 
TCCCAGCCCC 
CAGCGCTGCG 



8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
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TCCTGCTGCG CACGTGGGAA GCCCTGGCCC CGGCCACCCC CGCGATGCCG CGCGCTCCCC 13560 

GCTGCCGAGC CGTGCGCTCC CTGCTGCGCA GCCACTACCG CGAGGTGCTG CCGCTGGCCA 13620 

CGTTCGTGCG GCGCCTGGGG CCCCAGGGCT GGCGGCTGGT GCAGCGCGGG GACCCGGCGG 13680 

CTTTCCGCGC GCTGGTGGCC CAGTGCCTGG TGTGCGTGCC CTGGGACGCA CGGCCGCCCC 13740 

CCGCCGCCCC CTCCTTCCGC CAGGTGGGCC TCCCCGGGGT CGGCGTCCGG CTGGGGTTGA 13 800 

GGGCGGCCGG GGGGAACCAG CGACATGCGG AGAGCAGCGC AGGCGACTCA GGGCGCTTCC 13 860 

CCCGCAGGTG TCCTGCCTGA AGGAGCTGGT GGCCCGAGTG CTGCAGAGGC TGTGCGAGCG 13 920 

CGGCGCGAAG AACGTGCTGG CCTTCGGCTT CGCGCTGCTG GACGGGGCCC GCGGGGGCCC 13980 

CCCCGAGGCC TTCACCACCA GCGTGCGCAG CTACCTGCCC AACACGGTGA CCGACGCACT 14 040 

GCGGGGGAGC GGGGCGTGGG GGCTGCTGCT GCGCCGCGTG GGCGACGACG TGCTGGTTCA 14100 

CCTGCTGGCA CGCTGCGCGC TCTTTGTGCT GGTGGCTCCC AGCTGCGCCT ACCAGGTGTG 14160 

CGGGCCGCCG CTGTACCAGC TCGGCGCTGC CACTCAGGCC CGGCCCCCGC CACACGCTAG 14220 

TGGACCCCGA AGGCGTCTGG GATGCGAACG GGCCTGGAAC CATAGCGTCA GGGAGGCCGG 142 80 

GGTCCCCCTG GGCCTGCCAG CCCCGGGTGC GAGGAGGCGC GGGGGCAGTG CCAGCCGAAG 143 40 

TCTGCCGTTG CCCAAGAGGC CCAGGCGTGG CGCTGCCCCT GAGCCGGAGC GGACGCCCGT 14400 

TGGGCAGGGG TCCTGGGCCC ACCCGGGCAG GACGCGTGGA CCGAGTGACC GTGGTTTCTG 14460 

TGTGGTGTCA CCTGCCAGAC CCGCCGAAGA AGCCACCTCT TTGGAGGGTG CGCTCTCTGG 14520 

CACGCGCCAC TCCCACCCAT CCGTGGGCCG CCAGCACCAC GCGGGCCCCC CATCCACATC 14580 

GCGGCCACCA CGTCCCTGGG ACACGCCTTG TCCCCCGGTG TACGCCGAGA CCAAGCACTT 14 640 

CCTCTACTCC TCAGGCGACA AGGAGCAGCT GCGGCCCTCC TTCCTACTCA GCTCTCTGAG 14700 

GCCCAGCCTG ACTGGCGCTC GGAGGCTCGT GGAGACCATC TTTCTGGGTT CCAGGCCCTG 14760 

GATGCCAGGG ACTCCCCGCA GGTTGCCCCG CCTGCCCCAG CGCTACTGGC AAATGCGGCC 14820 

CCTGTTTCTG GAGCTGCTTG GGAACCACGC GCAGTGCCCC TACGGGGTGC TCCTCAAGAC 14880 

GCACTGCCCG CTGCGAGCTG CGGTCACCCC AGCAGCCGGT GTCTGTGCCC GGGAGAAGCC 14940 

CCAGGGCTCT GTGGCGGCCC CCGAGGAGGA GGACACAGAC CCCCGTCGCC TGGTGCAGCT 15000 

GCTCCGCCAG CACAGCAGCC CCTGGCAGGT GTACGGCTTC GTGCGGGCCT GCCTGCGCCG 15060 

GCTGGTGCCC CCAGGCCTCT GGGGCTCCAG GCACAACGAA CGCCGCTTCC TCAGGAACAC 15120 

CAAGAAGTTC ATCTCCCTGG GGAAGCATGC CAAGCTCTCG CTGCAGGAGC TGACGTGGAA 15180 

GATGAGCGTG CGGGACTGCG CTTGGCTGCG CAGGAGCCCA GGTGAGGAGG TGGTGGCCGT 15240 

CGAGGGCCCA GGCCCCAGAG CTGAATGCAG TAGGGGCTCA GAAAAGGGGG CAGGCAGAGC 15300 

CCTGGTCCTC CTGTCTCCAT CGTCACGTGG GCACACGTGG CTTTTCGCTC AGGACGTCGA 15360 

GTGGACACGG TGATCGAGTC GACTCCCTTT AGTGAGGGTT AATTGAGCTC GCGGCCGC 15418 

SEQ. ID NO: 2 (mTERT sequence, GenBank Accession AF121949) 

1 aagcttccag caaaccagtt agagctgagt tgatgctctg aagaagagaa aatgtagaga 
61 cggtactgaa caaataatgt ctgggcaaac ctcagacatg aaaatggaag acgtggaaat 
121 ccagagaact ctgagggaaa ataaaacaca actccaggtc atcacgggac tcatcaaact 
181 gctgaggtgc agccacagag aaaaatctta aaatagccta gaacgatgca tgacacataa 
241 agcacagaga agacgaagct gagtctgtct tgtaggaaca acttgagaag acctaaacca 
301 ctgcaatgag tgcattctgc taacttagaa tttgctaccc agttcagatc caaaaagggt 
361 ttcacaaagt tcaacacaaa acagtagcag gagtggctaa gggggacaca ctgataggaa 
421 ttcagagaag tagggaatgc tcatatgggg acattacaaa atgtactttc atgttgctta 
481 aatcatttta attgtcaacc acatcaagct aaataatgct ttgaggttca taacatttgg 
541 agattatgtc tacactagca gagaaggcac caataacatc ccaattgcta gattctcata 
601 gaatcatgag tcacaatggc agagacaggt tctgagagtg tgtccttgtt gtaaacagta 
661 tgctctacaa actaagttgg ctgcaatatc actaggcagt gttgtcccat aagacaacta 
721 tcacatatgt ggtccagtga tgaccaaagc atcttttagc attttgcaaa tgaagctcaa 
781 atcgaatatg actaagctca tgcagtacaa atcaaaggta cactgggata gtttaaaaga 
841 tacatacttg tactggttag ttttgtgtca gcttgacaca gctggagtta tcacagagaa 
901 aagagcttca gttgaggaaa ttcctccatg agatccagct atagggcatt ttctcaatta 
961 gtgatcaagg ggggaaggcc ccttgtgggt gggaccatct ctgggctggt agtcttggtt 
1021 ctataagaga gcaggctgag caagccagga gaagcaagcc agtaaagaac atccctccat 
1081 ggcttctgca tcagctcctg ctccctgacc tgcttgagtt ccagttctaa cttctttcag 
1141 tgatgaacag caatgtggaa atgaaagctg aataaaccct ttcctcccca ttttgcttct 
1201 tggtcatgat gtttgtgcag gaatagaaac cctgactaag acaatactat aaaccctaaa 
1261 agttgtaaac caaacacatg tgtttccatt aagccatcgt agaacaataa gtactcaacc 
1321 ccaagtcaca taactataat cccagccttt gaaaaccggg atcaggaatt caaggctagc 
1381 ctcatctata tgtaagatta aagcctgttt gggctgcatg agacttcgct tcaaaaaaaa 
1441 aaaaaaaaaa gcaaacaggc aaaaacaaac acaagacaag acagatgtaa aatgaaggag 
1501 gggtagatgg gtcaagtaga aaatagcata ggaaacgagt caagtataga agaggtggta 
1561 gtaaccagat catgcagaag gactcaaggc catctcctca cagtggctta ggtaggcctt 
1621 cctctgctct tgagcagggg cagagttgcc gctttaagga ggggatcagt cacctttaag 
1681 aactgaaaag ctgaacagtc ttctcaagtc agaagccagt ggcttcatct tacacctctc 
1741 ttccttccct tgctactcat attggatctg atgatttgcc caacttggaa gaaacatctc 
1801 ttctgaaggg tttcacagac accccatctt tccgagaaag gaccgcatag gctggccatc 
1861 cctgtgctta caaaaggaat aattaagaaa cttaattcca taagcaaata caacctttcc 
1921 aagccccaag tggatgattt tatcttactg tttttttata tctcatcaaa taacttccaa 
1981 gggctcaaaa atccaaagat gtaaaaaagg aactgagctc tgtttgccaa gccatgagga 
2041 ttaaataatg acattcaaag agatttttgt gccctaagta ctttttattg gttttcatag 
2101 acggtttaat gtgcaagatg aagcaaacag agatgggagt ggtatcagca tggattaagg 
2161 tggcagttgt gagggagggg tactgagaga acaggacaag gtaacctatc taaggagagg 
2221 ccaagttggc aagtgccagg gacttctaag cccagaacta gtacacattc cttaggtgct 
2281 gtttgggaag ccagggagtc accagccttg ggatctataa aagtgcatgg tggcattcac 
2341 tcacatactt cctgagctgt tcgatgttga tgaagtcgtg ggtatgagac tgttgtgtca 
2401 gtgacaaact atgtaaatga gaatgattgt ttccatcttg accactaaga cgtaaaccgg 
2461 ttccagtgat ctccaaacat ggcaagctac agcagagcag cagccccatc cagagccttg 



4 



WO 00/46355 PCT/USOO/03104 

2521 ccctggttct gaatggggga gaatccagtg ggagtcggtt gctgccagca tgttggggta 
2581 gaaggctgga gcatgacagg tccccgagga tttcctgctt cctatatggg tagggatact 
2641 tgaggtcctc tcttctacct ccttccctgc agggtttata acctctac™ ctgEctgtct 
2701 ctgggatagc tcctagggtg cagcccctcc ccaaaaaggc ctctccctgg cctcatgtct 
,„,, c ^asaacag ctttctaaag caggcctgtt acacaaaggc tcccttttcc tggcttcatc 
WWW gttg = tggta 9 ac aa=ttcc actcgttttc cacttcagtt tcttctactc tgttgttatt 
llll tllWltVtl g " tgaaccc a 9g3ttgtgt agtcagcaag tgctaccccc tccctcctct 
2941 tctttgtttt tttgaggcag ggtctcattt tgcccaagtg gacctaaatt tcagcatgta 
3001 gctggcctgg ttttgaatgc cttctcatcc tgcctctact tcccaagagt agcttacaag 
3 061 tgtgcaccac catgccccgc gatattctta tttttgagac tgttttctat gctggtttct 
WW "? gggaact ^ctaaggt agcttacaag tgtgcaccac catgccccgc gatattctta 
WW " tctgagac tgttttctat gctggtttct ttggggaact acactaaggt agcttcattg 
3241 ttggcataaa tttctcagtt caggcccata tctcctaagt agcagaacL agcaaatctc 
3 301 aaacaaaccc ctcaaaaaga ctgatgtcca ctaaacggac ttctaaaata gctcctgtaa 
3«i ^^T^ ttacaaggcg scagacctcc tataagggag taaatatgaa aacgcgcctg 
3421 ttcaaatgct aggtcggtgg atagaagcaa tttcctcaga aagctgaagg caccaaaggt 
35" : : l g a Tt ttCag tgtttgccaa a ctcagctac agtagaga^c acagattccc 

IWW ^ attt "f ag ^ttcaaaa ttcagcagcc cctctctaac tatggctcag agtcgtgtca 
IWW ttacatatgc ==<= aa caaca acccccaccc ctatcctacc cccgcctcac acgtgcaagt 
3721 S^SS? ttgccaacct a 3= a 3 a actg ccatcctaag gtcgaggtcg ccgctttggc 
3721 tgtgtgcaca ggcaagcgcc ctcacccaat ggccctggcc ttgctatggg tgcgtgagtt 
WWW ^" g f CgC ^tggactct gaggtgaagg ccactggaac agtgaaaaaa gctaacgcag 
3901 ta 3 tc ^" cctttggtgg tgggtgttta cggaacatat ttgggacctg 

3901 agtgtatggt cgcaccacaa taaagcctta acctatatag tagaatttca gctgtaatca 
WWW " aa 3 aa "g a 9 a ttgccac cacccacctc actgtctgtg tcaaccacag caggctggag 
4021 cagtcagctc aggaacaggc aaaaccttag gtccctccgc ctacctaacc ttcaatacat 
ilW « a ^ ata " cttctttgct tgcccaaacc tcgccccagt ctagaccacc tggggattcc 
4141 cagctcaggg cgaaaaggaa gcccgagaag cattctgtag agggaaatcc tg?a?gagtg 
4201 cgcccccttt cgttactcca acacatccag caaccactga acttggccgg ggaacacacc 
4261 tggtcctcat gcaccagcat tgtgaccatc aacggaaaag tactactgc? gSaccccgc 
4381 tlttttllS aCaaCgC " 9 gtccgc «g a a tcccgcccc ttcctccgtt Lcagcctca 
WWW ITWWrtn 3 , C9tggactct cagtBBectfl ggtcctggct gttttctaag cacacccttg 
AW = at " tggtt =«3= a cgtg ggaggcccat cccggccttg agcacaatga cccgcgctc? 
tlW WWWWWWWWW 9Cggtgcgct "ctgctgcg cagccgatac cgggaggtgt ggccgctggc 
WWW aa = C " tgtg =g9=3==tgg ggcccgaggg caggcggctt gtgcaacccg gggacccgL 
4621 gatctaccgc actttggttg cccaatgcct agtgtgcatg cactggggc? cacagcc^cc 
4681 acctgccgac ctttccttcc accaggtggg cctccaggcg ggatccccat gggtcagggg 
WbW lll^tlttn 99aggacgtg g3 a tagtgcg tctagctcat gtgtcaagac cctcttcicc 
4801 ttaccaggtg tcatccctga aagagctggt ggccagggtt gtgcagagac tctgcgagcg 
4861 caacgagaga aacgtgctgg cttttggctt tgagctgctt aacgaggcca gaggcgggc? 
till ™SS ttcactagta gcgtgcgtag ctacttgccc aacactgtta ctgagaccct 
4981 gcgtgtcagt ggtgcatgga tgctactgtt gagccgagtg ggcgacgacc tgctggtcta 
5041 cctgctggca cactgtgctc tttatcttct ggtgcccccc agctgtgcct accaggtgtg 
WW * gggtctccc ctgtaccaaa tttgtgccac cacggatatc tggccctctg tgtccgctag 
5161 ttacaggccc acccgacccg tgggcaggaa tttcactaac cttaggttct tacaacagat 
WW tt^ g T g W ag " gccagg aa 9c a ccgaa acccctggcc ttgccatctc gaggtacaaa 
5281 gaggcatctg agtctcacca gtacaagtgt gccttcagct aagaaggcca gatgctatcc 
lit, tgtcccgaga Stggaggagg gaccccacag gcaggtgcta ccaaccccat caggcaaatc 
5401 atgggtgcca agtcctgctc ggtcccccga ggtgcctact gcagagaaag atttgtette 
ttW ^ aaa " aaa9 9 ' gt " gacc tgagtctctc tgggtcggtg tgctgtaaac acaagcccag 
5521 ctccacatct ctgctgtcac caccccgcca aaatgccttt cagctcaggc catttattga 
WWW ! a ^ a9aC3t " cctttact =«ggggaga tggccaagag cgtctaaacc cctcattcct 
57M WW,TW,TWr.t "r a9CC " ac "g a «gg ggccaggaga ctggtggaga tcatctttct 
WW gggctcaagg cctaggacat caggaccact ctgcaggaca caccgtctat cgcgtcgata 
5761 ctggcagatg cggcccctgt tccaacagct gctggtgaac catgcagagt gccaatatgt 
5881 ZlTW, aggtCacatt g= a ggtttcg aacagcaaac caacaggtga cagatgcc?t 
WW Wtt .t 9C " accgcacc teatgsattt gctccgcctg cacagcagtc cctggcaggt 
5941 atatggtttt cttcgggcct gtctctgcaa ggtggtgtct gctagtctct ggggtac?ag 
6001 gcacaatgag cgccgcttct ttaagaactt aaagaagttc atctcgttgg ggaaatacgg 
6061 caagctatca ctgcaggaac tgatgtggaa gatgaaagta gaggattgcc actggctccg 
6121 cagcagcccg ggtgagcatg gctggtctcc agctgaatgc attaggggcc cagaaaaggg 
WW agacaatggg tggcagtaac ccaggtcccc agtggtgtgg tggctttatg cagtccg?g 9 
6241 ttggatgagt tccatcttat ggtctctgac tccaagctcc ctccagctcg ccctgcacaa 
6301 actaagattc ttgtccaagc cctgggcagg ttctcagggc tggggacatt gtggtgaaca 
6361 gataagcaga cggggagcat ggtggatagg agttctggca cagtgcacca gagagagtct 
6421 ggaagcgcta gtgagagcta atgtaagggc ccgtggttcg ccaaagaatg ataaccccgg 
6481 actcaaatag tatgccaaag caaggagcat ttcattctgc agaaatcaag catgcaggtg 
6541 gggggggggg gttgctctca ttccaagatg gagagacaac caagtataga ttttaagggg 
WWW a » C9 ??" CC " tatCttac tccatctcta ggggcattcc attactgggg catgggg??g 
6661 gaggttggaa actgttaatg gggaggtctg gaaacttgct gccccattgt ccttgcttca 
WW g3Ctaggtag "gagtagct tctaatggca ggatagtttc tgactagctg tctaaagtct 
WWW ggggtgttt g "tttttgtt ttttctagta acttacttgc ctgaacttgc tcagttttta 
6901 lllllllltt ~ tggaCtgc = aa tttgaag cctattaagg agccagcccg tctcactact 
6901 ccaggttatc tataatcccc ctgtagaacg gtacctcact gataacaatg acagaccaac 
6961 ataggaaccc actatccttg tggtgcatga gtttcaaagg ttcttctggt cctcccagtg 
7081 ^ agatC = a tg^taagct atggtcctcc cagtgtgcag atccgtgctt aagctatggt 
7081 cttgcagctg ctcgatctac aaagggtagg gtgaacgaag gaaagataaa tgaaaaaaaa 
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7141 aaaactgttt cctacagtga agatcgctgc 
7201 gtggagcctg gtgcataaaa gaggattgtg 
7261 tgtgccctcc ttgcctggtt ttctgggttt 
7321 cctgacccct tccctttcag ccaaccctcc 
7381 caaacgccct atcctgctcc cccttcccca 
7441 aaaagatggt agagctatgt ttacccacca 



cccatcttag ctatgagaag ggactgggga 
ttacttggaa ggctgcagag cctggactcc 
aatgttgagg ttggccctct gtagtcacta 
ggttacaccc tgtgcatgta tggaaggggc 
aaattcttag gatattaaca acttatgggg 
•tgtacttggg aagctccgaa gtaagctt 
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